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GENS MUTATED IN WOLFRAM SYNDROMS 



FIELD OF THE INVENTION 

This invention relates to the field of 
diagnosis and treatment of certain forms of diabetes. In 
particular, this invention provides a novel gene, the 
disruption of which is associated with Wolfram Syndrome, 
and methods of using the gene and specific mutants 
thereof as diagnostic tool for prediction and early 
detection of the syndrome. 

BACKGROTOJD OP THE INVENTION 

Various scientific and scholarly articles are 
referred to in brackets throughout the specification. 
These articles are incorporated by reference herein to 
describe the state of the art to which this invention 
pertains . 

Wolfram syndrome (WPS) (OMIM 222300) was first 
described in 1938 as a combination of familial 
juvenile -onset diabetes mellitus and optic atrophy. 
Other clinical features subsequently emerged and 
accordingly, WFS is also referred to as the DIDMOAD 
(diabetes insipidus, diabetes mellitus, optic atrophy, 
and deafness) syndrome. Most patients with this 
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progressive disorder eventually develop all four cardinal 
manifestations, and die prematurely with widespread 
atrophic changes throughout the brain. Insulin- requiring 
diabetes mellitus occurs with mean age of onset at 6-8 
years. When examined, pancreatic islets display atrophic 
and insulin-producing p-cells selectively eJDsent . The 
disease is believed to account for 1/150 patients with 
young-onset insulin- requiring diabetes mellitus. 

The pathogenesis of Wolfram syndrome is 
unknown. Diagnosis is usually made in offspring of 
unaffected often-related parents, suggesting autosomal 
recessive inheritance. Linkage of the gene to markers on 
chromosome 4p (Polymeropoulos et al . , Nature Genetics 8: 
95-97, 1994) was reported, and later confirmed (Collier 
et al.. Am L Hum Genet 59: 855-863, 1996). Recombinants 
placed the gene in an interval between- markers 5.5 cM 
apart . 

Isolation and characterization of the gene or 
genes associated with Wolfram Syndrome is vital for the 
prediction, diagnosis and, ultimately, treatment of the 
disease. Currently there is no way of knowing for sure 
if an individual is predisposed to Wolfram Syndrome, 
particularly children too young to have developed the 
characteristic symptoms. Early diagnosis may lead to 
effective methods of treatment. Identification and 
isolation of the gene or genes associated with Wolfram 
Syndrome would further enable the development of 
screening procedures to assist in genetic counseling, as 
well as enabling detailed study of its function and 
subsequent development of therapeutic methods or agents. 

SUMMARY OF THE INVENTION 

In accordance with the present invention, a 
gene is provided, whose mutation is highly correlated 
with Wolfram Syndrome in humans. The symptoms of Wolfram 
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Syndrome consist of diabetes insipidus, diabetes 
mellitus, optic atrophy and deafness. This gene is found 
in humain genome between markers D4S500 and D4S431 on 
chromosome 4p, and is referred to herein as WFSl, One 
variant of WFSl is found in SEQ ID NO:l. 

Another aspect of the invention comprises 
several isolated nucleic acids from the human and mouse 
WFSl genes. In a preferred embodiment, these nucleic 
acids are the genomic sequence of the WFSl gene from 
human (SEQ ID NO:l) , a cDNA sequence from the human WFSl 
gene (SEQ ID N0:2) , and a cDNA from the mouse WFSl gene 
(SEQ ID N0:4) . In another preferred embodiment, the 
isolated nucleic acids are substantially the same or 60% 
homologous to SEQ ID NO:l, SEQ ID NO:2, and SEQ ID NO:4. 
In yet another preferred embodiment, the nucleic acids 
encode SEQ ID NO: 3 and SEQ ID NO: 5, substantially the 
same variants of SEQ ID NO: 3 and SEQ ID NO: 5, variants 
with at least 60% homology to SEQ ID NO: 3 and SEQ ID 
NO: 5, and variants of SEQ ID NO: 3 with the mutations and 
polymorphisms listed in Table 1. In a final preferred 
embodiment, the isolated nucleic acids are 
oligonucleotides (SEQ ID Nos:6-41) that have been 
designed based on SEQ ID NO:l, 

In accordance with another aspect of the 
invention, a selection of isolated polypeptides is 
provided, which result from the expression of part or all 
of the human and mouse WFSl genes . In one preferred 
embodiment, the polypeptides are encoded by SEQ ID NO; 3 
and SEQ ID NO: 5. In another preferred embodiment, the 
polypeptides are sxibs t ant i ally the same as SEQ ID NO: 3 
and SEQ ID NO: 5, or at least 60% homologous to SEQ ID 
NO: 3 and SEQ ID NO: 5. 

In accordance with another aspect of the 
invention, a selection of antibodies that are 
immunologically specific to the aforementioned 
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polypeptides is provided. 

In accordance with another aspect of the 
invention, a series of methods is provided, which use the 
WFSl gene to genetically characterize mammalian subjects. 
In a preferred embodiment, the WFSl gene is used to make 
labeled probes which are used to detect VIFSl nucleic 
acids. In another preferred embodiment, variant forms of 
WFSl are sequenced, compared to the sequence of WFSl, and 
mutations and polymorphisms determined. In a 
particularly preferred embodiment, human genomic DNA is 
sequenced and compared to SEQ ID NO : 1 . In a very 
particularly preferred embodiment, the primers in Table 2 
are used to sequence the variant human gene. In yet 
cuiother preferred embodiment, restriction enzymes are 
selected that differentially digest the wild type and 
variant gene, WFSl nucleic acids are digested and 
separated by size, and the WFSt nucleic acids are 
detected. In a pairticularly preferred embodiment, the 
human DNA is digested, and the primers in Table 2 are 
used to an^lify the DNA before digestion. 

Other features and advantages of the present 
invention will be better understood by reference to the 
drawings, detailed descriptions and exan?>les that follow. 

BRIEF DESCRXFTICN OF THE DRAWXNOS 

Figure 1. Pedigrees of Wolfram syndrome 
families, with individuals designated by disease status 
(bold-lined symbols affected, thin-lined symbols 
Tonaf fected) , and with derived haplotypes of chromosome 4p 
markers. Bold, underlined numbers represent disease- 
associated chromosomes. Italicized, underlined numbers 
refer to disease-associated chromosomes with historical 
recombinants. Consanguineous Japanese families: Fig- la, 
WS-1; Fig. lb, WS-2; Pig. Ic, 
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WS-3 (an affected daughter, not shown, was deceased 
before initiation of the study) . Caucasian families: 
Fig. ID, WS-4; and Fig. IE, WS-5. Markers were ordered 
from telomeric to centromeric as described in Exaitple 1. 

Figure 2. Physical map of the Wolfram syndrome 
critical region. Fig. 2A: The horizontal line at the top 
of the figure represents a portion of chromosome 4p, with 
the centromere to the right and pter to the left. The 
dashed line represents the interval of D4S500-D4S431, the 
critical region of the Wolfram syndrome gene. PI and BAC 
clones are represented as lines. Their length reflects 
the number of STSs and not the actual size. The name of 
each clone is given to the right of the line. Marker 
names are noted above the line and correspond to the 
symbols on the line. Fig. 2B: An expanded schematic of 
the genomic structure of the Wolfram gene, WFSl, with 
exons indicated by boxes. The entire gene is encompassed 
within a 33.4 kb region. 

Figure 3 . Expression of WFSl mRNA in adult 
tissues. Fig. 3A: Northern analysis with human adult 
polyA+ RNA (5 /zg) derived from various tissues was 
hybridized with a "P- labeled 854 -bp genomic fragment of 
the VfFSl gene. Fig. 3B: Re -hybridization of the* blot 
with ^'P-labeled p-actin cDNA as a loading control. Fig. 
3C: Northern analysis with human adult total RNA (2 0 ^9) 
hybridized with "P-labeled WFSl cDNA. Fig. 3D: Re- 
hybridization of the blot in c. with "P-labeled ribosomal 
cDNA as a loading control. 

Figure 4. Hydrophobicity analysis was 
conducted by the method of Kyte and Doolittle (J Molec 
Biol 157: 105-132, 1982) using a window size of 9 amino 
acid residues. Average hydrophobicity values are plotted 
as a function of position along the polypeptide chain. 

Figure 5, Comparison of human and mouse WFSl 
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protein aequezices. Plain text indicate identical 
residues. Amino acid gaps between h\iinan and mouse 
proteins are shown by dashes. The locations of the 
mutations found in Wolfram syndrome patients, 3 missense, 
the premature stop codon (X) , 2 deletions, and the 7 bp 
repeat insertion, are indicated below the sequences. 
A predicted prenyltransf erase a-subtinit repeat structure 
(Ad93 to L502) is underlined. 

Figure 6. Co-segregation of WFSl mutations with 
the disease phenotype in Wolfram syndrome families. Gig. 
6a: The 1685del(N) in family WS-2, showing that each 
affected child (III-l, -2, -3, and -4) is homozygous for 
the mutation. Fig. 6b: Sequence chroraatograms of the 
region of exon 8 showing the 15 bp deletion in a patient 
homozygous for the mutation, along with a normal control. 
Fig. 6c: Segregation of the 1681C to T and the 
microscopic deletion in family WS-5. The 1681C to T 
mutation destroys a BsmFl site in a 766 bp PCR 
(polymerase chain reaction) fragment. The hemizygous 
T(-) affected children (II-l, -2, and -4) have only a 766 
bp uncut fragment, while the hemizygous C(-) mother (1-4) 
and unaffected daughter (II-3) have 686 bp and 80 bp 
bands, and the heterozygous CT father (1-3) has 766 bp, 
686 bp and 80 bp bands. The symbols and " + " refer to 

the absence or presence of enzyme. 

nSTAlUCD DESCRZPTION OF THE ZNVENTXON 

I - Definitions 

Various terms relating to the biological 
molecules of the present invention are used hereinabove 
and also throughout the specification and claims. 

With reference to nucleic acids molecules, the 
term "isolated nucleic acid" is sometimes used. This 
term, when applied to DNA, refers to a DNA molecule that 
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ie separated from sequences with which it is immediately 
contiguous (in the 5' and 3' directions) in the naturally 
occurring genome of the organism from which it was 
derived. For example, the "isolated nucleic acid" may 
comprise a DNA molecule inserted into a vector, such as a 
plasmid or virus vector, or integrated into the genomic 
DNA of a procaryote or eucaryote. An "isolated nucleic 
acid molecule" may also comprise a cDNA molecule. 

With respect to RNA molecules, the term 
"isolated nucleic acid" primarily refers to an RNA 
molecule encoded by an isolated DNA molecule as defined 
above. Alternatively, the term may refer to an RNA 
molecule that has been sufficiently separated from RNA 
molecules with which it would be associated in its 
natural state (i.e., in cells or tissues), such that it 
exists in a "substantially pure" form (the term 
"substantially pure" is defined below) . 

With respect to proteins or peptides, the term 
"isolated protein (or peptide)" or "isolated and purified 
protein (or peptide) " is sometimes used herein. This 
term refers primarily to a protein produced by expression 
of an isolated nucleic acid molecule of the invention. 
Alternatively, this term may refer to a protein which has 
been sufficiently separated from other proteins with 
which it would naturally be associated, so as to exist in 
"substantially pure" form. 

The term "substantially pure" refers to a 
preparation comprising at least 50-60% by weight the 
compound of interest . (e .g . , nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 
Purity is measured by methods appropriate for the 
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compoiind of interest (e.g. chromatographic methods, 
agarose or polyacryl amide gel electrophoresis, HPLC 
analysis, and the like) . 

Nucleic acid sequences and amino acid sequences 
can be compared using computer programs that align the 
similar sequences of the nucleic or amino acids thus 
define the differences. In the comparisons made in the 
present invention, the BLAST programs (NCBI) and 
parameters used therein were employed, and the DNAstar 
system (Madison, WI) was used to align sequence fragments 
of genomic DNA sequences. However, equivalent alignments 
and similarity/identity assessments can be obtained 
through the use of any standard alignment software. For 
instance, the GCG Wisconsin Package version 9.1, 
available from the Genetics Computer Group in Madison, 
Wisconsin, and the default parameters used (gap creation 
penalty=12, gap extension penal ty=4) by that program may 
also be used to compare sequence identity and similarity. 

The term "substantially the same" refers to 
nucleic acid or amino acid sequences having sequence 
variation that do not materially affect the nature of the 
protein (i.e. the structure, stability characteristics, 
substrate specificity and/or biological activity of the 
protein) . With particular reference to nucleic acid 
sequences, the term "substantially the same" is intended 
to refer to the coding region and to conserved sequences 
governing expression, and refers primarily to degenerate 
codons encoding the same amino acid, or alternate codons 
encoding conservative substitute amino acids in the 
encoded polypeptide. With reference to amino acid 
sequences, the term "substantially the same" refers 
generally to conservative substitutions and/or variations 
in regions of the polypeptide not involved in 
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determination of structure or function. 

The terms "percent identical" and "percent 
similar" are also used herein in comparisons among amino 
acid and nucleic acid sequences. When referring to amino 
acid secjuences, ^'percent identical" refers to the percent 
of the amino acids of the subject amino acid sequence 
that have been matched to identical amino acids in the 
compared amino acid sequence by a sequence analysis 
program. "Percent similar" refers to the percent of the 
amino acids of the subject amino acid sequence that have 
been matched to identical or conserved amino acids. 
Conserved amino acids are those which differ in structure 
but are similar in physical properties such that the 
exchange of one for another would not appreciably change 
the tertiary structure of the resulting protein. 
Conservative substitutions are defined in Taylor (1986, 
J. Theor. Biol. 119:205). When referring to nucleic acid 
molecules, '^percent identical" refers to the percent of 
the nucleotides of the subject nucleic acid sequence that 
have been matched to identical nucleotides by a sequence 
analysis program. 

With respect to antibodies, the term 
"immunologically specific" refers to antibodies that bind 
to one or more epitopes of a protein of interest, but 
which do not substantially recognize and bind other 
molecules in a sample containing a mixed population of 
antigenic biological molecules. 

With respect to oligonucleotides or other 
single- stranded nucleic acid molecules, the term 
"specifically hybridizing" refers to the association 
between two single- stranded nucleic acid molecules of 
sufficiently complementary sequence to permit such 
hybridization under pre -determined conditions generally 
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used in the art (sometimes termed "substantially 
complementary") . In particular, the term refers to 
hybridization of an oligonucleotide with a substantially 
complementary sequence contained within a single- stranded 
DNA or RNA molecule, to the substantial exclusion of 
hybridization of the oligonucleotide with single- stranded 
nucleic acids of non-complementary sequence. 

A '^coding sequence" or '"coding region" refers 
to a nucleic acid molecule having secjuence information 
necessary to produce a gene product, when the sequence is 
expressed. 

The term "operably linked" or *^operably 
inserted" means that the regulatory sequences necessary 
for expression of the coding sequence are placed in a 
nucleic acid molecule in the appropriate positions 
relative to the coding sequence so as to enable 
expression of the coding sequence. This same definition 
is sometimes applied to the arrangement other 
transcription control elements (e.g. enhancers) in an 
expression vector. 

Transcriptional and translational control 
sequences are DNA regulatory sequences, such as 
promoters, enhancers, polyadenylation signals, 
terminators, and the like, that provide for the 
expression of a coding sequence in a host cell . 

The terms ^promoter" , "promoter region" or 
"'promoter sequence" refer generally to transcriptional 
regulatory regions of a gene, which may be found at the 
5» or 3 • side of the coding region, or within the coding 
region, or within introns . Typically, a promoter is a 
DNA regulatory region capable of binding .RNA polymerase 
in a cell and initiating transcription of a downstream 
(3' direction) coding sequence. The typical 5' promoter 
sequence is bounded at its 3 • terminus by the 
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transcription initiation site and extends upstream (5' 
direction) to include the minimuTn number of bases or 
elements necessary to initiate transcription at levels 
detectable above background. Within the promoter 
sequence is a transcription initiation site (conveniently 
defined by mapping with nuclease SI) , as well as protein 
binding domains (consensus sequences) responsible for the 
binding of RNA polymerase. 

A "vector" is a replicon, such as plasmid, 
phage, cosmid, or virus to which another nucleic acid 
segment may be operably inserted so as to bring about the 
replication or expression of the segment. 

The term "nucleic acid construct" or "DNA 
construct* is sometimes used to refer to a coding 
sequence or sequences operably linked to appropriate 
regulatory sequences and inserted into a vector for 
transforming a cell . This term may be used 
interchangeably with the term trans forming DNA" . Such a 
nucleic acid construct may contain a coding sequence for 
a gene product of interest, along with a selectable 
marker gene and/or a reporter gene. 

The teirm "selectable marker gene" refers to a 
gene encoding a product that, when expressed, confers a 
selectable phenotype such as antibiotic resistance on a 
transformed cell . 

The term "reporter gene" refers to a gene that 
encodes a product which is easily detectable by standard 
methods, either directly or indirectly. 

A "heterologous" region of a nucleic acid 
construct is an identif ieible segment (or segments) of the 
nucleic acid molecule within a larger molecule that is 
not found in association with the larger molecule in 
nature. Thus, when the heterologous region encodes a 
mammalian gene, the gene will usually be flanked by DNA 
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that does not flank the mammalian genomic DNA in the 
genome of the source organism. In another exan^le, a 
heterologous region is a construct where the coding 
sequence itself is not found in nature (e.g,, a cDNA 
where the genomic coding sequence contains introns, or 
synthetic sequences having codons different than the 
native gene) . Allelic variations or naturally- occurring 
mutational events do not give rise to a heterologous 
region of DNA as defined herein. The term '^DNA 
construct", as defined above, is also used to refer to a 
heterologous region, particularly one constructed for use 
in transformation of a cell. 

A cell has been "transformed" or " transf ected" 
by exogenous or heterologous DNA when such DNA has been 
introduced inside the cell- The transforming DNA may or 
may not be integrated (covalently linked) into the genome 
of the cell. In prokaryotes, yeast, and mammalian cells 
for example, the transforming DNA may be maintained on an 
episomal element such as a plasmid. With respect to 
eukaryotic cells, a stably transformed cell is one in 
which the transforming DNA has become integrated into a 
chromosome so that it is inherited by daughter cells 
through chromosome replication. This stability is 
demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population 
of daughter cells containing the transforming DNA. A 
"clone" is a population of cells derived from a single 
cell or common ancestor by mitosis. A "cell line" is a 
clone of a primary cell that is capable of stable growth 
in vitro for many generations. 

Description 

In accordance with the present invention, a 
novel human gene, WFSl, has been isolated. The 
inheritance of a mutated P»^S1 gene is highly correlated 
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with the development of Wolfram Syndrome in the human 
population. Normal function of WFSl appears to be 
essential for survival of pancreatic islet (3 -cells and 
neurons. Included in the invention is the method for 
using the gene sequence to genetically screen for 
presence of the potentially mutated forms of the gene for 
diagnosis and prognosis of the disease in patients. 

The WFSl gene in humans spans 3 3.4 kb on 
chromosome 4p and is composed of 6 exons (Fig. 2B; SEQ ID 
NO:l) . The -5)cb of sequence upstream of the start of 
WFSl open reading frame is expected to contain one or 
more transcriptional or translational regulatory 
elements. The WFSl human cDNA is 3.688 kh long (SEQ ID 
NO: 2) and encodes a predicted protein 890 residues long 
with a predicted molecular mass of 100.29 kDa (SEQ ID 
NO: 3) . Comparison of the WFSl cDNA sequence with those 
in public databases found no related genes. A mouse cDNA 
of WFSl has also been isolated. The mouse cDNA (SEQ ID 
NO:4) is' 3511 nucleotides long and has 83.9% nucleotide 
identity to the human gene. The predicted protein 
sequence encoded by the mouse cDNA (SEQ ID NO: 5) has a 
86.1% amino acid similarity to the predicted human WFSl 
protein. 

The inventors have isolated the WFSl gene 
represented by SEQ ID N0:1 from the human genome by 
positional cloning. Previous work had isolated the 
Wolfram gene between markers D4S432 and D4S431 on 
chromosome 4p, 5.5 cM (-5500 kb) apart. In the 
development of this invention, five families with 
individuals having typical Wolfram syndrome phenotypes 
were genotyped with genetic markers shown to locate 
between D4S432 and D4S431 by physical and /or radiation 
hybrid mapping (see Example 1) . The region containing the 
Wolfram gene was thus narrowed further to a region 
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between D4S500 and D4S431, The critical region between 
D4S500 and D4S431 was estimated to be <250 kb as 
determined by contig mapping of BAC and PI genomic 
clones. Three clones were sequenced and the sequence of 
much of the contig region was determined. 

Exon trapping of two BAC clones was employed to 
generate expressed sequence tags (ESTs) of the region and 
then to determine areas with open reading frames that 
would be likely to contain a gene. Among a number of 
ESTs isolated was one predicted to be the 3 ' end 1.8 kb 
exon of a gene . A genomic fragment of this region 
hybridized to a 3.7 kb RNA on a Northern blot . The gene 
was determined to be expressed in all tissues, but 
surprisingly was most abundant in pancreatic islets 
compared to that in the exocrine pancreas . The abundance 
of WFSl RNA in pancreatic islets was particularly 
revealing because one of the outcomes Wolfram Syndrome is 
atrophy of pancreatic islets. 

A full length cDNA clone of WFSl was obtained 
by screening a human infant brain cDNA library (SEQ ID 
N0:2) . This clone was 3,688 nucleotides long and 
contained an appropriate start methionine, open reading 
frame, and polyadenylation signal. Comparison of the 
cDNA sequence of WFSl with those in public databases 
revealed no related genes. Translation of the cDNA 
sequence predicts a polypeptide of 890 amino acid 
residues with a molecular mass of 100.29 kDa. The 
protein is distinguished grossly by the presence of 3 
structural domains, a hydrophilic N- terminal region of 
-300 residues, a hydrophilic C- terminal region of -240 
residues, and a central hydrophobic core of -350 
residues. Inspection of the hydrophobicity curve 
suggests the presence of -10 transmembrane segments. 

When patients with Wolfram Syndrome were 
checked for mutagenesis in the WFSl gene, seven mutations 
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were found in the full-length clones derived from the 
original EST. PGR (polymerase chain reaction) was used 
to amplify exons of WFSl from subjects with Wolfram 
Syndrome, and the products were sequenced. Comparison of 
these sequences with the wild type gene revealed probable 
loss of function mutations in all cases, as described in 
greater detail below. 

The following description set forth the general 
procedures involved in practicing the present invent ion - 
To the extent that specific materials are mentioned, it 
is merely for purposes of illustration and is not 
intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 
forth in Sambrook et al . , Molecular Cloning , Cold Spring 
Harbor Laboratory (1989) (herein *»Sambrook et al.") or 
Ausubel et al . (eds) Current Protocols in Molesul^ 
Biology . John Wiley & Sons (1999) (herein ^Ausubel et 
al.*) are used. Unless otherwise specified, general 
genome analysis procedures were used, such as set forth 
in Genome Analysis ; h Laboratory Mannaif Cold Spring 
Harbor Laboratory Press (1997) . 

!• Preparation of WFSl nucleic acid molecules , encoded 
proteins and imsamo logically specific antibodies 

A. Nucleic Acid Molecules 

Nucleic acid molecules comprising part or all 
of the WFSl gene of the invention may be prepared by two 
general methods: (1) they may be synthesized from 
appropriate nucleotide triphosphates, or (2) they may be 
isolated from biological sources. Both methods utilize 
protocols well known in the art. 

The availability of nucleotide sequence 
information, such as Sequence I.D. Nos . 1, 2 and 4, 
enables preparation of an isolated nucleic acid molecule 
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of the invention by oligonucleotide synthesis. Synthetic 
oligonucleotides may be prepared by the phosphor amadite 
method employed in the Applied Biosystems 38A DNA 
Synthesizer or similar devices. The resultant construct 
may be purified by high performance liquid chromatography 
(HPIiC) . Long, double -stranded polynucleotides, such as a 
DNA molecule of the present invention, must be 
synthesized in stages, due to the size limitations 
inherent in current oligonucleotide synthetic methods. 
Thus, for example, a double -stranded DNA molecule several 
Jcilobases in length may be synthesized as multiple 
smaller segments of appropriate complementarily . 
Complementary segments thus produced may be annealed such 
that each segment possesses appropriate cohesive termini 
for attachment of an adjacent segment. Adjacent segments 
may be ligated by annealing cohesive termini in the 
presence of DNA ligase to construct an entire double- 
stranded molecule. A synthetic DNA molecule so 
constructed may then be cloned and amplified in an 
appropriate vector. 

WFSl nucleic acid sequences may be isolated 
from appropriate biological sources using methods known 
in the art. In a preferred embodiment, a human genomic 
clone is isolated from a human genomic PI library. In 
another preferred embodiment, a cDNA clone is isolated 
from the Marathon -Ready human fetal brain X gtlO cDNA 
library (Clontech) . In yet another preferred embodiment, 
a mouse cDNA is isolated from a mouse pancreatic 3-cell 
line (MIN6) cDNA library. The isolation of human and 
mouse clones is not limited to the aforementioned 
libraries, and other commercially avail aJDle human and 
mouse libraries may be used. Alternatively, cDNA or 
genomic clones from other species may be obtained. 

In accordance with the present invention, 
nucleic acids having the appropriate sequence homology 
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with part or all of Sequence I.D. Nos . 1, 2 or 4 may be 
identified by using hybridization and washing condition 
of appropriate stringency. For example, hybridizations 
may be performed, according to the method of Sambrook et 
al., using a hybridization solution comprising: 5 x SSC, 
5 X Denhardt's reagent, 1.0% SDS, 100 ^g/rcH denatured, 
fragmented salmon sperm DNA, 0.05% sodium pyrophosphate 
and up to 50% formamide. Hybridization is carried out at 
37-420C for at least six hour. Following hybridization, 
filters are washed as follows: (1) 5 minutes at room 
tenperature in 2 x SSC and 1% SDS; (2) 15 minutes at room 
temperature in 2 x SSC and 0.1% SDS; (3) 30 minutes -1 
hour at 37oC in l x SSC and 1% SDS; (4) 2 hours at 42- 
65oin 1 X SSC and 1% SDS, changing the solution every 3 0 
minutes. In a preferred embodiment, hybridizations are 
performed in hybridization solution comprising 0. S M 
NaPO*, 2 mM EDTA, 7% SDS and 0.1% sodium pyrophosphate (pH 
7.1) at about 65oC for 20 hours. For high- stringency 
conditions, membranes are subsequently washed 
sequentially for 1 hour each in: (1) 2X SSC, 0.5X SET, 
0.1% sodium pyrophosphate; and (2) O.IX SSC, 0.5X SET, 
0.1% sodium pyrophosphate. For low- stringency 
conditions, membranes are washed at 50oC for 3 0 minutes 
in 2X SSC, 0.5X SET, 0.1% sodium pyrophosphate. 

One common formula for calculating the 
stringency conditions rec[uired to achieve hybridization 
between nucleic acid molecules of a specified sequence 
homology (Sambrook et al . , 1989): 

T„ = 81.5*'C + 16.6Log [Na+] + 0.41(% G+C) - 0.63 (% formamide) - 600rtWjp in duplex 

As an illustration of the above formula, using [N+] = 
[0.368] and 50% formamide, with GC content of 42% and an 
average probe size of 200 bases, the T„ is 57*'C. The T„ 
of a DNA duplex decreases by 1 - 1.5**C with every 1% 
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decrease in homology. Thus, targets with greater than 
about 75% sequence identity would be observed using a 
hybridization temperature of 42 'C. 

The stringency of the hybridization and wash 
depend primarily on the salt concentration and 
temperature of the solutions. In general, to maximize 
the rate of annealing of the probe with its target, the 
hybridization is usually carried out at salt and 
temperature conditions that are 20-25**C below the 
calculated T„ of the of the hybrid. Wash conditions 
should be as stringent as possible for the degree of 
identity of the probe for the target. In general, wash 
conditions are selected to be approximately 12-2 0*0 below 
the T„ of the hybrid. In regards to the nucleic acids of 
the current invention, a moderate stringency 
hybridization is defined as hybridization in 6X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 /ig/ml denatured 
salmon sperm DNA. at 42**C, and wash in 2X SSC and 0.5% SDS 
at 55°C for 15 minutes, A high stringency hybridization 
is defined as hybridization in 6X SSC, 5X Denhardt's 
solution, 0.5% SDS and 100 /^g/ml denatured salmon sperm 
DNA at 42*'C, and wash in IX SSC and 0.5% SDS at 65°C for 
15 minutes. A very high stringency hybridization is 
defined as hybridization in 6X SSC, 5X Denhardt's 
solution, 0.5% SDS and 100 //g/ml denatured salmon sperm 
DNA at 42^*0, and wash in 0 . IX SSC and 0.5% SDS at es^'C 
for 15 minutes. 

Nucleic acids of the present invention may be 
maintained as DNA in any convenient cloning vector. In a 
preferred embodiment, genomic clones are maintained in a 
COS -7 cells in the vector pSPL3 (Life Technologies, 
Inc.) . In another preferred embodiment, PCR products are 
subcloned into pAMPlO using the UDG cloning kit (GIBCO 
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BRL) , and propagated in a suitable E. coli host cell. In 
another preferred embodiment, clones are maintained in 
plasmid cloning /express ion vector, such as pGEMT 
(PROMEGA) , and propagated in £. coli. 

WFSl nucleic acid molecules of the invention 
(including those containing known polymorphisms and 
mutations) include cDNA, genomic DNA, RNA, and fragments 
thereof which may be single- or double- stranded. Thus, 
this invention provides oligonucleotides (sense or 
ant i sense strands of DNA or RNA) having sequences capable 
of hybridizing with at least one sequence of a nucleic 
acid molecule of the present invention, such as selected 
segments of Sequence I.D. Nos. 1, 2 and 4. Such 
oligonucleotides are useful as probes for detecting WFSl 
genes (and specific mutations) in test samples, e.g. by 
PCR amplification, or as potential regulators of gene 
expression. 

B. Proteins and Antibodies 

A full-length WFSl -encoded protein of the 
present invention may be prepared in a variety of ways, 
according to )cnown methods. The protein may be purified 
from appropriate sources, e.g., human or animal cultured 
cells or tissues, by immunoaf f inity purification. 
However, due to the limited amount of such a protein that 
may be present in a sample at any given time, 
particularly in tumors or tumor cell lines, conventional 
purification techniques are not preferred in the present 
invention. 

The availability of the isolated WFSl coding 
sequence enables production of protein using in vitro 
expression methods known in the art. For example, a cDNA 
or gene may be cloned into an appropriate in vitro 
transcription vector, such a pSP64 or pSP65 for in vitro 
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transcription, followed by cell-free translation in a 
suitable cell -free translation system, such as wheat germ 
or rabbit reticulocytes. In vitro transcription and 
translation systems are contmercially available, e.g., 
from Promega Biotech, Madison, Wisconsin or BRL, 
Rockville, Maryland. 

Alternatively, the recombinant protein may be 
produced by expression in a suitable procaryotic or 
eukaryotic system. For example, part or all of a DNA 
molecule, such as the cDNA having SEQ ID NO:2 or No. 4, 
may be inserted into a plasmid vector adapted for 
expression in a bacterial cell, such as E. coli, or into 
a baculovirus vector for expression in an insect cell . 
Such vectors comprise the regulatory elements necessary 
for expression of the DNA in the bacterial host cell, 
positioned in such a manner as to permit expression of 
the DNA in the host cell. Such regulatory elements 
required for expression include promoter sequences, 
transcription initiation sequences and, optionally, 
enhancer sequences. 

The protein produced by WFSl gene expression in 
a recombinant procaryotic or eukaryotic system may be 
purified according to methods known in the art . A 
commercially available expression/secretion system can be 
used, whereby the recombinant protein is expressed and 
thereafter secreted from the host cell, to be easily 
purified from the surrounding medium. If 

expression/secretion vectors are not used, an alternative 
approach involves purifying the recombinant protein by 
affinity separation, such as by immunological interaction 
with antibodies that bind specifically to the recombinant 
protein. Such methods are commonly used by skilled 
practitioners. 

Proteins prepared by the aforementioned methods 
may be analyzed according to standard procedures. For 
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example^ such proteins may be subjected to amino acid 
sequence analysis, according to known methods. 

Included in the present invention are 
antibodies capaUsle o£ immunospecifically binding to 
proteins of the invention. Polyclonal antibodies 
directed toward WFSl- encoded proteins may be prepared 
according to standard methods. Monoclonal antibodies may 
be prepared, which react immunospecifically with various 
epitopes of the proteins . Monoclonal antibodies may be 
prepared according to general methods of K6hler and 
Milstein, following standard protocols. Polyclonal or 
monoclonal antibodies that immunospecifically interact 
with P^F51 -encoded proteins can be utilized for 
identifying and purifying such proteins. For example, 
antibodies may be utilized for affinity separation of 
proteins" with which they immunospecifically interact. 
Antibodies may also be used to immunoprecipitate proteins 
from a sample containing a mixture of proteins and other 
biological molecules. Other uses of antibodies are 
described below. 

IX. Uses o£ WFSl Mucleic Acids, Encoded Proteins 
and Izmnuno logically Specific Antibodies 

A. WFSl Nucleic Acids 

Nucleic acids comprising part or all of the 
WFSl gene may be used for a variety of pxirposes in 
accordance with the present invention. As illustrated in 
Example 1, selected WFSl sequences {DNA, RNA or fragments 
thereof) may be used as probes to identify mutations or 
rearrangements in a patient's DNA, and/or monitor the 
level of WFSl transcripts in tissues. As discussed 
earlier, WPSI mutations are associated with the 
occurrence of Wolfram Syndrome, a disease of autosomal 
recessive inheritance. Early identification of patients 
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deatined to develop Wolfram Syndrome may lead to 
preventive therapies. Identification of heterozygous 
individuals that are at risk of having a child with 
Wolfram Syndrome will be very useful in genetic 
counseling. 

WFSl sequences may be utilized as probes in a 
variety of assays known in the art, including but not 
limited to: (1) in situ hybridization; (2) Southern 
hybridization; (3) northern hybridization; and (4) 
assorted anplif ication reactions, such as polymerase 
chain reaction (PGR) . In a preferred embodiment, large 
deletion and premature termination mutations are detected 
by separation on acryl amide or agarose gel 
electrophoresis and Southern blotting with probes made 
from WFSl gene sequences. Knowledge of the wildtype 
sequence allows the identification of point mutations in 
non- functional WFSl genes. In another preferred 
embodiment, mutated genes are differentiated from 
wildtype genes by using restriction enzyme sites that 
appear or disappear as the result of the mutation. WFSl 
nucleic acids are digested and the fragments are then 
separated and probed as described above . 

Both of the above-mentioned preferred 
embodiments are illustrated in Example 1 . The humaji 
genomic WFSl sequence was used to design screening 
procedures to quickly screen all the individuals in a 
Wolfram Syndrome patient's extended family. In the case 
of large deletion mutations, the screen entailed 
amplifying the region of the gene encompassing the 
deletion, then separating the products by agarose gel 
electrophoresis and performing Southern blot 
hybridization using a labelled wild type PGR fragment. 
In the case of point mutations, the PGR primers were 
designed as above, the mutant and wildtype products were 
digested with a restriction enzyme that was specific to 
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either the sequence at the point mutation or the 
corresponding wildtype sequence. The products of the 
digestion, along with undigested control nucleic acids, 
were separated and detected as eCbove. In all of the six 
extended families studied, inheritance of a homozygous 
complement of a mutated WFSl gene was consistent with the 
development of the disease and the pedigree of the 
family. Of particular interest are the three mis -sense 
mutations in the human WFSl predicted polypeptide, whose 
wild type sequence is found to be conserved in the 
predicted mouse polypeptide. This sequence conservation, 
together with the mutational effect of non- conservative 
amino acid substitutions at these sites, suggests that 
these amino acids are critical for gene function. A 
primary screen of these sequences is a useful way to 
expedite the identification of mutations. Other critical 
mis-sense mutations may also be useful in connection with 
this invention. These other mutations can be found using 
procedures detailed in Example 1 and others well known in 
the art. 

The WFSl nucleic acids of the invention may 
also be utilized as probes to identify related genes 
either from humans or from other species. In a preferred 
embodiment, a cDNA has been isolated from mouse 
insulinoma cDNA library by standard screening methods and 
RACE PCR. The mouse cDNA was 3,511 nucleotides long with 
83 . 9% nucleotide identity to the coding sequence of the 
human gene and 86.1% predicted amino acid similarity. 
While the cloning of the mouse cDNA is illustrated in 
Example 1, those skilled in the art will appreciate that 
the isolation of WFSl from other species is by no means 
limited to mouse. Other mammalian species of interest 
include, but are not limited to, cow, cat, dog, horse, 
pig and rat. As is well known in the art, hybridization 
stringency may be adjusted so as to allow hybridization 
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of nucleic acid probes with complementary sequencing of 
viarying degrees of homology. 

The cDNA from the mouse wfsl gene is very 
useful because it allows further study of the gene and 
Wolfram Syndrome in system more conducive to research 
than human. The mouse clone may be used to create a 
targeting construct, which can be used for the targeted 
mutagenesis of the endogenous mouse gene. By creating 
both null and site-directed mutants, a mouse model system 
for Wolfram Syndrome ccin be generated. This mouse system 
can subsequently be used to elucidate the cellular 
function of the Wolfram gene product, as well as assist 
in developing therapies for the syndrome. Systems other 
than mouse can also be used to advantage. These systems 
include, but are not limited to animal models developed 
in mouse, various cultured human and mammalian cell 
systems (e.g., mouse and rat insulinoma cells) and frog 
oocyte expression systems , 

As described above, the coding region of WFSl 
may also used to advantage to produce substantially pure 
WFSl encoded proteins or selected portions thereof. As 
described below, these proteins may also be used in 
diagnosis and therapy of Wolfram Syndrome. 

B. Proteins and Antibodies 

The WF51- encoded protein, or fragments thereof, 
may be used to produce polyclonal or monoclonal 
antibodies, which also may serve as sensitive detection 
reagents for the presence and accumulation of the WFSl- 
encoded polypeptide in cultured cells or tissues from 
living patients (the term "patient" refers to both humans 
and animals) . Because the WFSl -encoded protein has not 
yet been isolated from natural sources, such antibodies 
will greatly accelerate the identification, isolation and 
characterization of this protein in mammalian cells and 
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tissues. Recombinant techniques enable expression of 
fusion proteins containing part or all of the WFSl- 
encoded protein. The full-length protein or fragments of 
the protein may be used to advantage to generate an array 
of monoclonal antibodies specific for various epitopes of 
the protein, thereby potentially providing even greater 
sensitivity for detection of the protein in cells or 
tissues. Monoclonal antibodies specific to variant 
portions of the WFSl polypeptide may also be used to 
advantage in diagnosing presence of a variant form of the 
gene . 

Polyclonal or monoclonal antibodies 
immunologically specific for the WFSI- encoded protein may 
be used in a variety of assays designed to localized 
and/or quantitate the protein. Such assays include, but 
are not limited to: (1) flow cytometric analysis; (2) 
immunochemical localization of the protein in cultured 
cells or tissues; and {3) immunoblot analysis (e.g., dot 
blot. Western blot) of extracts from cells and tissues. 
Additionally, as described above, such antibodies can be 
used for the purification of WFSI-encoded proteins (e.g., 
affinity column purification, immunoprecipitation) . 

The following example is provided to describe 
the invention in greater detail . It is intended to 
illustrate, not to limit, the invention. 

EXAMPLE 1 
Isolation and Identification of 
Wolfram Gene euxd Mutants 

METHODS 

Patients and families. Three families 
originated from Japan: WS-1 (Nanko et al . , Brit J 
Psychiatry Hi: 282, 1992), WS-2 (Higashi, Am J Otology 
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12: 57-60, X991, and WS-3 (Maruta et al . , Clin Neurol 27: 
725,732, 1987). Cell lines for the Caucasian family WS-4 
were obtained from the NIGMS Human Genetic Mutant Cell 
Repository, Camden, NJ.. {family #1157) . Family WS-5 is 
Caucasian Australian and WS-6 Saudi Arabian. Minimum 
criteria for diagnosis were young-onset insulin -dependent 
diabetes mellitus and progressive optic atrophy. 

Microsatellite cuaalysls. Initial genotyping 
was carried out as described previously (Nestorowicz et 
al., Hum Molec Genet 7: 1119-1128, 1998; Inoue et al . , 
Diabetes 45: 789-794, 1996) with markers reported in the 
1996 Genethon Microsatellite Map {D4S127, Hox7 , D4S412, 
D4S3023, D4S2925, D4S431, D4S2935, D4S3007 , D4S394, 
D4S2983, D4S2923) . Second genotyping was performed with 
markers {D4S2957, D4S2375, A348XA5, D4SB27, D4S500r and 
D4S2366) , shown to locate in the interval of D4S3023' 
D4S431 by physical and radiation hybrid mapping projects 
of chromosome 4. The STANFORD CHR 4 YAC MAP project and 
the CHROMOSOME 4 SUMMARY MAP were viewed at WEB sites 
(http : //shgc . Stanford . edu/ , 

http://cedar.genetics.soton.ac.uk/). Primer sequences 
were obtained from the Genome Database (GDB - 
ht tp : / /www . hgmp . mrc . ac . ok/ gdb/gdbtop . html ) . 

Pl/BAC library screezilng. A human genomic PI 
library (HD-K) was screened for clones containing D4S500 
and D4S431 by PCR, using primers developed to 
specifically hybridize with those markers. Sequence- 
tagged sites (STSs) (SP6 and T7) from Pis 102C5, 89C1 and 
77B6, as well as D4S500 and D4S431, were used for BAC 
library screening by either PCR or by direct 
hybridization of library grid blots (Research Genetics, 
Inc., Huntsville, Alabama). 

£xon- trapping, sample and shotgun megabase 
gencamic sequencing. Restriction fragments from BAC 460K9 
and 33H22 were cloned into the BamHI site of pSPL3 , 
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transfected into COS -7 cells and spliced products 
obtained by RT-PCR were subcloned into pAMPlO using the 
UDG cloning kit (GIBCO BRL) and sequenced. 

Sample and shotgun sequencing of BAC4 60K9 and 
3 3H22 was accomplished as described (Wilson and Mardis, 
in Genome Analysis: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, New York, NY, 1997) . Similarity 
searches with known genes and ESTs were performed using 
BIAST programs (NCBI) . The sequencing project was 
compiled using LaserGene software (DNAStar, Madison, WI) . 
Sequencing was performed on an ABI 373 v^ith Prism dye 
terminator kits. 

Northern blot analysis, cDNA screening, and 5' 
RACE analysis. A multiple adult human tissue polyA+ RNA 
Northern blot (Clontech- MTN human I) was probed with an 
854 -bp- fragment (nt 2133-2986 WFSl cDNA) as described by 
the manufacturer. Total cellular RNA was isolated 
(Chomczynski and Sacchi, Anal Biochem 162 : 156-159, 1987) 
and Northern analysis performed with 20 fig of RNA by 
hybridization with WFSl cDNA labeled with [a-^^P] dCTP as 
previously described (Ferrer et al . , Diabetes 386- 
392, 1997) . RNA quality and loading was checked by 
staining the gel for ribosomal RNA and by hybridization 
with either p- act in or ribosomal RNA. 

The same 854 -bp fragment was used for screening 
an infant brain X gtlO cDNA library. Six overlapping 
clones containing the WFSl gene were isolated. 5' RACE 
analysis was performed using Marathon- Ready human fetal 
brain cDNA (Clontech, Palo Alto, CA) . The RACE products 
were subcloned into pGEMT vector (Promega Biotech, 
Madison WI) and sequenced. A mouse pancreatic ^-cell 
line (MIN6) cDNA library was screened with human WFSl 
cDNA using standard reduced stringency conditions, and a 
3.0 kb clone was isolated. A primer corresponding to the 
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predicted 5' -untranslated region, based on the mouse EST 
sequence (GenBank AA021827 and AA692227) that was 
homologous to the human cDNA, was synthesized and used 
for RT-PCR with MIN6 RHA ( 5 ' - CGGTTTCGGAGCAACTTCGC - 3 ' , 
SEQ ID NO: 42 and 5'- CACCTCAGCCTCGTTCTCAG -3', SEQ ID 
NO:43) . 

Mutation detection. M13 universal primer 
sequence was incorporated into the 5' terminus of primers 
for direct sequence analysis using an ABI automated DNA 
sequencer Model 373 (Chadwick et al . , Biotechniques 20: 
676-683, 1996). Primers used for exon amplification, 
genomic sequencing and mutation detection are as in Table 
2 . 

The 2812del(TC) was detected with a labeled PCR 
fragment on a denaturing Long Ranger pol y aery 1 amide gel 
(FMC Bioproducts) . The 1685del (N^g) was detected by 
electrophoresis of PCR fragments on 4% agarose gels. The 
234 IC to T was detected by PCR, digestion with EcoNI (New 
England Biolabs (NEB) ) emd polyacryl amide -gel 
electrophoresis. The mutated allele was observed as 209 
bp and 3 5 bp fragments. 

The 2254G to T mutation was detected by PCR, 
digestion with Avail (NEB) and agarose gel 
electrophoresis, with two bands (240 bp, 137 bp) in the 
wild allele, and three (24 0 bp, 12 0 bp, 17 bp) in the 
mutant. The 2114G to A mutation was detected by PCR, 
digestion with Tfi I (NEB) and agarose gel 
electrophoresis. The wild allele has one band of 528 bp, 
and the mutated allele has two bands (396 bp, 132 bp) . 
The 168 IC to T mutation destroys a BsmFl restriction 
site, and primers are set 8a (Table 2) , with resulting 
fragments described in Fig. 6c. 

GenBank Accession Numbers. The human WFSl and 
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mouse wfsl cDNA sequences are deposited in GenBank: 
#AF084481 and #AF084482 . 

RESULTS 

Linkage Analysis. Linkage studies were 
conducted on three Japanese families (WS-1, -2, and -3) 
and two Caucasian families (WS-4, -5) (Figs. lA-lE) , each 
with at least two individuals having typical Wolfram 
syndrome phenotypes. When genotyping with chromosome 4p 
markers used previously (Polymeropoulos et al . , Nature 
Genetics £: 95-97, 1994; Collier et al . , Am J Hum Genet 
59 ; 855-863, 1996), estimates for recombination fractions 
(6) between WFS and the markers confirmed close linkage 
(lod =3.99 for D4S431 at 6 =0.05). Multipoint analysis 
with GENEHUNTER (version 1.1) (Kruglyak et al . , Am J Hum 
Genet S9: 1347-1363, 1996) gave a lod->6.0 for the region 
encompassed by markers D4S827 and D4S394. 

Haplotype Axxalysis and Happing by 
Recombination. Haplotypes were constructed by 
inspection. The boundaries for the WFS gene had been 
defined by a telomeric recombinant at D4S432 (Collier et 
al., 1996), and centromeric recombinants at D4S431 
(Polymeropoulos et al . , 1994; Collier et al . , 1996). 
Recombinants in families WS-1 - WS-6 were identified by 
genotyping with genetic markers shown to locate between 
D4S432 and D4S431 by physical and/ or radiation hybrid 
mapping. In family WS-1, subject III-2, recombination of 
the telomeric region from D4S827 was observed (Fig. lA) . 
In the Japanese family WS-2 (Fig. IB), all affected 
subjects (III-l, -2, -3, and -4) were haploidentical 
centromeric to D4S500, suggesting the presence of a 
historical recombinant in the unrelated father (II -5) . 
The centromeric boundary was confirmed as D4S431 by a 
recombinant in WS-5, subject III-2 (Fig. IE) . We 
concluded that the WFS gene likely lies within the 
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interval D4S500'D4S431 . 

Construction o£ a genomic contlg across the WFS 
critical region. In the STANFORD CHR 4 YAC MAP project, 
D4S827 and D4S431 are within two overlapping YACs 
encompassing the region (204H9 and 420E4) . Further, 
D4S500 and D4S431 were located on the same YAC (420E4> . 
A contig was constructed with genomic human Pl/BAC 
clones, and a contiguous genomic map encompassing the WFS 
gene was obtained (Figure 2) . STS content mapping 
confirmed overlapping clones. As the interval between 
D4S500 and D4S431 was covered by a PI (102C5) and a BAG 
clone (33H22) , the critical region was estimated to be 
<250 kb. 

Identification o£ candidate genes witliln the 
WFS region, and cloning the WFSl gene. By exon trapping 
of BAGS 33H22 and 460K9, an expressed sequence tag (EST) 
was found that resulted in the cloning of the y-isoform 
of the B regulatory subunit of the human protein 
phosphatase 2A (PP2ABRy) . The genomic structure of 
PP2ABRY was determined, and direct secjuence analysis of 
probands excluded this gene. In parallel, we initiated 
large-scale genomic sequencing from BACs 460K9 and 33H22, 
and PI 102C5. A total of -180 kb of sequence was 
analyzed. Among a number of EST matches, one was 
predicted to be the 3 ' -end exon containing a 1.8 kb open 
reading frame (exon 8, Fig. 2B) . This EST was further 
evaluated as it was unambiguously within the critical 
region. 

The size of the full-length mRNA and pattern of 
tissue expression was determined by PGR amplification of 
an 854 -bp genomic fragment of the region that was 
hybridized to a multiple -tissue Northern blot with polyA+ 
RNA. A major transcript of -3.7 kb was expressed in all 
tissues including pancreas (Fig. 3A) . Northern analysis 
of total RNA (20 ^g) revealed the gene most abundantly 
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expressed in pancreatic islets compared to that in 
exocrine pancreas (Fig. 3C> , 

A full-length clone was obtained by screening a 
human infant brain cDNA library. Six clones were 
isolated and subsequently 5' RACE analysis was performed. 
These analyses yielded a con^osite cDNA sequence of 3.688 
kb. The longest open reading frame extended from nt 171 
to 2843. The methionine at position 171 was chosen as 
the translation initiation codon primarily because it 
conforms to Kozak's rule (Kozak, Mamm Genome X' 563-574, 
1996) . A consensus polyadenylation site (aataaa) was 
located at position 3615-20, 19 bases upstream from the 
polyA tail. The gene was named WFSl. 

Predicted characteristics of the WFSl protein, 
and cloning o£ the mouse cDMA« Comparison of the cDNA 
- sequence of WFSl with those in public databases revealed 
no related genes. Translation of the cDNA sequence 
predicts a polypeptide of 890 amino acid residues with a 
molecular mass of 100.29 kDa, Hydrophobicity analysis of 
the deduced amino acid sequence is presented in Figure 4 . 
The protein is distinguished grossly by the presence of 3 
structural domains, a hydrophilic N-terminal region of 
-3 0O residues, a hydrophilic C- terminal region of -24 0 
residues, and a central hydrophobic core of -3 50 
residues. Inspection of the hydrophobicity curve 
suggests the presence of -10 transmembrane segments, if 
it is assumed that this region of the protein consists of 
a-helical segments . Comparison of the predicted amino 
acid sequence with entries in the Prosite database 
produced a single match to the prenyl transferase a- 
siibunit repeat structure. 

A mouse wfsX cDNA was isolated from a mouse 
insulinoma (MIN6) (Ishihara et al., DieOaetologia 1^: 
1139-1145) cDNA library, and completed by RT-PCR. The 
mouse wfal cDNA was 3511 nucleotides with 83.9% 
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nucleotide identity to the coding sequence of the human 
gene, and 86.1% amino acid similarity (Figure 5). 

Genomic structure of the WFSl gene and 
mutations in WFS patients. The genomic structure of WFSl 
was determined by comparison of cDNA and genomic 
sequences obtained by shotgun sequencing of BAC4 60K9 and 
33H22 and sequences in the Stanford Humsm Genome Center 
database (http://www.shgc.stanford.edu). The gene was 
found to be composed of eight exons (Pig. 2B} in 33.4 kb 
of genomic DNA. 

For mutation screening, exons were amplified 
and sequenced from patients' genomic DNA. A TC deletion 
at position 2812 for subject WS-1 III-2 predicted a 
frameshift at codon 882, designated del882f s/ter937 
(Table 1) , with absence of the normal stop codon at 891 
cind the introduction of a new downstream termination 
codon. The predicted WFSl protein contains 937 amino 
acids, 47 more than the normal protein. All 3 affected 
sibs (WS-1 III-l, -2, and -4) were homozygous for this 
mutation, while the unaffected sib and the parents were 
heterozygous, indicating a disease- specif ic mutation. 
The 2812delTC mutation was not found in 80 healthy 
control Japanese subjects (160 chromosomes, see Table 1) . 

In other WFS families, six additional mutations 
were found in exon 8 (TcO^le 1). In family WS-2, affected 
offspring (III-l, -2, -3, and -4) inherited a 15 bp 
deletion resulting in delSOSYVYLL, homozygous by descent 
from related heterozygous parents. Co -segregation of 
this deletion with the WFS phenotype is shown in Fig 6A. 
A sequence chromatogram from an affected child homozygous 
for the 15 bp deletion is shown in Fig. 6B. In family 
WS-3, both affected offspring (II-l and -2) were 
homozygous for a 2341 C to T transversion resulting in a 
P724L mutation. In the Caucasian family WS-4, all 
affected offspring <II-1, -3, and -4) were found to be 
compound hetero zygotes for a 2254 G to T transition 
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resulting in a G695V (paternal) mutation, and a 2114 G to 
A transversion resulting in a W64 8X (maternal) mutation. 
The W64 8X mutation predicts a premature termination, and 
lack of 242 amino acids of the C- terminus. In each of 
these 4 families the mutations were shown to co-segregate 
with the disease phenotype, both by sequencing and by 
either size change, or alteration of a restriction 
endonuclease site. None of the mutations were foiind in 
Japanese or Caucasian control subjects. No other coding 
variants were found on sequencing the entire gene in each 
proband . 

In the Australian family WS-5, a 1681C to T 
transversion (P504L) was observed. This mutation 
destroys a BsmFl restriction site. In Fig. 6C, the 
father (1-3) is shown to be heterozygous for 168 IC to T, 
and the mother (1-4) is homozygous 168 IC. Yet 
surprisingly all affected offspring (II-l, -2, and -4) 
appeared to be homozygous 1681T. The unaffected child 
(II -3) appeared to be homozygous for 1681C. The most 
likely explanation for these findings is that the 
mother's chromosome, inherited by each child (see 
haplotypes in Fig. IE) , harbored a microscopic deletion 
for the WFSl gene, and that the affected offspring were 
hemizygous for the P504Ij mutation. 

A sixth family (WS-6) with one 10 year old 
affected child and two apparently unaffected younger 
sisters became available for analysis. The parents were 
Saudi Arabian first cousins. The affected child was 
homozygous for a 7bp repeat insertion at 1610 (CTGAAGG) , 
resulting in a predicted frame shift and premature 
termination of the protein at codon 544 , The parents 
were heterozygous, while the unaffected sisters were 
heterozygous and homozygous normal respectively. 
Sequence analysis also revealed a number of silent and 
intronic variants (polymorphisms) in various families 
(see Table 1) . 
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Tablo 1, Mutations and polymorphisms in WFSl 



Mutation 



Amino acid 



Bxon/incron 



Family 



10 



15 



20 



2ai2del (TC) 
1685del (CCTGCT 
CTATGTCTA) 
2341C to T 
2254G to T 
2114G to A 
1681 C to T 
del WFSl'' 
l€10insCTGAAGG 



del882£8/ter937 Exon S 

delSOBYVYIOi Bxon 6 

P724L Bxon B 

a€95V Bxon 8 

W648X Bxon 8 

P504L Excn 8 

ins4B3£s/ter544 Excn 8 



WS-1 
WS-2 

HS-3 
WS-4 
WS-4 
WS-5 
WS-S 
WS-6 



Control 
cluromo sosM 



160* 
ISO ' 

160 • 

160>» 
160*' 
160** 
ND 
160** 



Cbangas of 
uncertain 

ef£ect- 

11670 to A X333V Exon 8 WS-4 {same chr. ND 

as W648X } 



Polymorphisms 



35 



8540 to C 


D26eD 


Exon 


6 


WS-4 


MD 


1355T 


to 


c 


V39SV 


Exon 


8 


WS-4 


ND 


1457C 


to 


T 


C429C 


Bxon 


8 


WS-3 


ND 


1537A 


to 


G 


R456H 


Exon 


B 


WS-1, -2, -3,-4, 
normal Q 


ND 


1545C 


to 


T 


L459L 


Bxon 


8 


WS-3 


ND 


1570T 


to 


C 


L507L 


Bxon 


8 


WS-4 


ND 


leisc 


to 


T 


L549L 


Bxon 


B 


WS-4 


ND 


1925C 


to 


T 


F585F 


Bxon 


8 


WS-4 


ND 


2002G 


to 


A 


H611R 


Bxon 


8 


WS-1, -2, -3, 
normals 


ND 


2603A 


to 


G 


K811K 


Bxon 


8 


WS-4 


ND 


2735G 


to 


A 


sasss 


Bxon 


8 


WS-4 


ND 


i032-5C to G 




Intrcn 7 


WS-1, -2, -3,-4 


ND 



a-Japanese 
b- Caucasian 

50 

c-Not confirmed 
d-Palestinian Arabs 
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TabXe 2. Primers for amplification, sequencing and 
mutation detection (SEQ ID NOS: 6-41, consecutively) 

Prlxaors 



Product 

SlSQ 



1* 

Sxon 2 
3 



5 

Bxon 6 
Sxon 7 
Sxon 8a 
Sxon 8b 
Sxon 8c 
Sxon 8d* 
Bxon 8e 
Sxon 8f 
2812del<TC) 
16e5dol (N) „ 
2341C to T 
22540 to T 
21140 to A 



5 • TGTAAAACGACGGCCAC3TCTCGTGCAGAAGGCCGCGCT3 ' 
5 • CAGOAAACAGCTATGACCGCCCACAGCCACCGCGCCACS ' 

5 ' TaTAAAACGACGGCCAGTCTGTCTCCAGCAGACACTAA3 • 
5 * CAGGAAACA0CTATQACCCACAATaCTGAACTGCAGAG3 ' 

5 ' TGTAAAACQACGQCCAQTCTQAAQACCCTCATGCCTTQ3 ' 
5 • CAGaAAACAGCrATGACCACACTTCTCTGTGGGCraTG3 * 

5 ' TGTAAAACGACGGCCAaTTCQGAaAATCTGGAGGCTGA3 ' 
5 • CAGGAAACAGCTATGACCCATTACAAGCTGCTCAACCC3 ' 

5 • TOTAAAACGACGGCCAGTCGAAAGCCTTCCAGaCAGAGB ' 
S ' CAGGAAACAGCTATQACCCrATG0GAAGGTCCTGGCTC3 ' 

5 ' TGTAAAACGACGGCCAGTCTAGGAACAGTGCGCCAGTT3 ' 
5 * CAGGAAACAGCTATGACCATaGAaTCGCACAGGAAGQA3 ' 

5 'TGTAAAACGACGaCCAGTGCCCATGCrGTTTTCTCTCAS * 
5 • CAOGAAACAGCTATGACCCCGAGGACACATCCTTATGAS ' 

5 ' TGTAAAACGACGGCCAGTCCTCGTTCCCACGTACCATC3 ' 
5 • CAOGAAACAaCTATGACCGTAGCAGTAGGTGCCCTTGA3 ' 

5 » TQTAAAACGACGGCCAGTCCTGGTCGTCCTCAATGTCA3 ' 
5 • CAGGAAACAGCTATGACCCATAGAACCAGCAGAACAGCS * 

5 •TGTAAAAaGACGGCCAGTTGGTTCACaTCTCTGGAGCT3 ' 
5 • CAQGAAACAGCTATGACCGAGTTGTAGACCTTCATGCC3 ' 

5 ' TaTAAAA0GACGGCCAGTGGGCATGAAaGTCTACAACT3 ' 
S • CAGGAAACAGCTATGACCGAACTTCTTGATGTGGCAGGB • 

5 'TGTAAAACQACGGCCAGTCrGGATGCGCTGCCTCTACaB • 
5 ' CAGGAAACAGCTATGACCTCAGGCCGCCGACAGQAATG3 * 

5 ' TGTAAAACGACGGCCAaTTCGCCTTCGACTTCTTTTTCS ' 
5 * CAGGAAACAGCTATGACCCCAAACAAATAAGAAATGCT3 ' 

5* GCC CAG CTC TCG CCC ACC AG 3' 
5 ' TCA GGC CGC CGA CAG GAA TG 3 • 

5* CCT GOT CGT OCT CAA TGT CA 3' 
5' GGT AGG GCA CAA GGT AGC AG 3' 

EcoMI-F;5' GGGCATGAAGGTCTACAACTCCA 3' 

ECOMI -R ; 5 • CCGTAGAGGCAGCGCATCCAGTCGCCGAc£tAGAAC3 ' *♦ 

5 • -GAGGGCATGAAGGTCTACAA-3 ' 
5 ' - CCCACGGTAATCTCAAACTT- 3 ' 

S ' -TAOTGTGCCCCTGCTGTTGC-B ' 
5 ' -CCCACGGTAATCTCAAACTT -3 ' 



247 bp 
276 bp 
276 bp 
2S3 bp 
353 bp 
26 B bp 
371 bp 
766 bp 
503 bp 
240 bp 
362 bp 
523 bp 
499 bp 
120 bp 
119 bp 
244 bp 
377 bp 
528 bp 



• Due to the inability to directly sequence these PCR products, the 
fragments were subcloned into pGEM-T Easy Vector (Promega) as described 
by the manufacturer and several colonies sequenced for each individual. 

** Because originally there was no appropriate restriction enzyme site to 
distinguish mutated alleles, the EcoNI-R primer was modified (3 -base 
change, tga to cct, underlined in primer sequence) and a new EcoNi site 
was introduced to the mutated allele (CC2341 (C/T)NNNNNAGG) . 
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DiscassxoN 

Consanguineous" families from isolated regions 
of Japan provided the genetic material that led to the 
discovery of mutations in WFSl in WFS patients of diverse 
genetic backgrounds. We believe that mutant alleles at 
WFSl are responsible for the disease for several reasons, 
beyond the fact that the gene maps to the critical 
region. In each of the six pedigrees, mutant alleles of 
WFSl co-segregated with the disease phenotype . WFSl was 
shown to be expressed in brain, pancreatic islets, and in 
a p-cell insulinoma cell line, consistent with the 
disease phenotype. Seven different mutations were found, 
as well as a presumed microscopic deletion. The three 
missense mutations were evolutionarily conserved between 
the mouse and human (Figure 5) , further suggesting their 
biological significance. None of the mutations were 
observed in normal chromosomes* 

The Australian Caucasian family WS-5 was 
particularly interesting, as each affected child appeared 
to be homozygous for a P504Ij mutsmt allele inherited from 
the heterozygous father (Fig. 6C) . Repeat sampling cind 
analysis confirmed these results. Analysis of the 
mother's DNA with new markers between D4S500 and D4S431 
suggested that the deletion was confined to a region of 
<170 kb. Recently a patient with another autosomal 
recessive disorder was observed to be heterozygous for a 
missense mutation in combination with a partial deletion 
of a gene (Ries et al . , Human Mutation 12.: 44-51, 1998). 

The expression pattern of WFSl appeared 
ubiquitous by Northern analysis of polyA+ RNA (Fig. 3A) . 
Yet interestingly, the most prominent mRNA observed in 
total RNA was that in pancreatic islets (Fig, 3C) . This 
high level of expression of WFSl in islets might explain 
why the earliest manifestation of WFS is insulin- 
deficient diabetes mellitus (Barrett and Bundey, J Med 
Genet 34: 838-841, 1997) . Further analysis of the cell 
biology of WFSl will be accomplished through generation 
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of specific antibodies, monitoring expression in cultured 
cells, and gene targeting to define genotype/phenotype 
relationships . 

Swift et al. hypothesized that heterozygous 
carriers of the gene for WFS were 26 -fold more likely to 
require psychiatric hospitalization than non-carriers 
(Swift et al.. Molecular Psychiatry 1: 86-91, 1998). 
Blackwood et al (Blackwood et al . , Nature Genetics 
12.:427-430, 1996) reported highest lod scores with 
markers mapping to the region D4S431-D4S4 03 in a genome 
scan of a large family with bipolar affective disorder. 
These findings suggest that mutations in WFSl might be 
implicated in patients with psychiatric diseases. 

Mutations in WFSl appear to result in premature 
death of pancreatic islet 3 -cells leading to juvenile 
onset insulin- requiring diabetes mellitus (Karasik et 
al., Diabetes Care 12: 135-138, 1989), The (3-cell damage 
in autoimmune Type I diabetes likely results from the 
interaction of the HLA locus as a major susceptibility 
gene, along with multiple minor gene defects. In 
contrast, islet 3 -cell loss in WFS is monogenic in 
origin. Itt^ortantly , the WFSl gene appears to play a 
major role in maintaining normal islet 3-cell function, 
as mutations in this gene alone can result in loss of 
islet 3-cells, Genome scans for both Type I and Type II 
diabetes mellitus have not implicated major genes in the 
4p region. Yet since these are complex diseases, 
mutations in WFSl might play a minor role in these more 
common forms of diabetes. In addition, WFSl may 
represent a new therapeutic target for treatment and 
prevention of diabetes mellitus and for neurodegenerative 
disorders. 

The present invention is not limited to the 
embodiments described and exemplified above, but is 
capable of variation and modification without departure 
from the scope of the appended claims. 
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What is claimed: 

1. A recombinant DNA molecule comprising a 
vector into which is inserted a heterologous DNA segment 
from human chromosome 4p, the segment being located 
between markers D4S500 and D4S431, the segment comprising 
a gene, mutations of which are associated with Wolfram 
Syndrome . 

2. The recombinant DNA molecule of claim 1, 
wherein the gene is composed of exons that form an open 
reading frame having a sequence that encodes a 
polypeptide about 880 to 90 0 amino acids in length. 

3. The recombinant DNA molecule of claim 2, 
wherein the open reading frame encodes an amino acid 
having greater than 60% identity with SEQ ID NO: 3 or SEQ 
ID NO: 5. 

4 . The recombinant DNA molecule of claim 4 , 
wherein said open reading frame comprises a sequence 
having greater than 60% homology with SEQ ID NO: 2 or SEQ 
ID NO: 4. 

5. The recombinant DNA molecule of claim 1, 
wherein the gene is composed of exons having sequences 
greater than 60% homologous with the sequences of the 
corresponding exons in SEQ ID NO:l. 

6. An oligonucleotide between about 10 and 100 
nucleotides in length, which specifically hybridizes with 
a portion of the recombinant DNA molecule of claim 1 . 

7. An isolated nucleic acid molecule having a 
sequence that is part or all of a sequence selected from 
the group consisting of: 

a) SEQ ID NO: 1; 
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b) a variant of SEQ ID NO:l that is 
substantially the same as SEQ ID N0:1 within the exons of 
SEQ ID NO:l; 

c) a sequence having at least homology 60% 
to SEQ ID N0:1 within the exons of SEQ ID N0:1; 

d) SEQ ID NO:2; 

e) a variant of SEQ ID NO: 2 that is 
substantially the same as SEQ ID NO:2; 

f) a sequence having at least homology 60% 

to SEQ ID NO:2; 

g) a sequence encoding a polypeptide 
substantially the same as SEQ ID NO:3; 

h) a sequence encoding a polypeptide at 
least 60% homologous to SEQ ID N0:3; 

i) a sequence encoding a polypeptide 
substantially the same as SEQ ID NO:3, that additionally 
comprises one or more of the sequence variants set forth 
in Table 1 . 

j) SEQ ID NO:4; 

k) a variant of SEQ ID NO: 4 that is 
substantially the same as SEQ ID N0:4; 

1) a sequence having at least homology 60% 

to SEQ ID NO:4; 

m) a sequence encoding a polypeptide 
substantially the same as SEQ ID NO: 5; and 

n) a sequence encoding a polypeptide at 
least 60% homologous to SEQ ID NO: 5. 

8. An oligonucleotide between 10 and 100 bases 
in length, that specifically hybridizes with a portion of 
the nucleic acid molecule of claim 7. 

9. A polypeptide, which is produced by the 
expression of the nucleic acid molecule of claim 7. 



10. Antibodies immunologically specific for 
the polypeptide of claim 9 . 
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11. A polypeptide produced by expression of an 
isolated nucleic acid molecule comprising part or all of 
an open reading frame of a gene located on human 
chromosome 4p between markers D4S500 and D4S431, 
mutations of which are associated with Wolfram Syndrome. 

12. The polypeptide of claim 11, which 
comprises a hydrophilic N-terminal region of about 300 
amino acid residues, a hydrophilic C-terminal region of 
about 240 residues, and a central hydrophobic core of 
about 350 residues. 

13. The polypeptide of claim 12, having an 
amino acid sequence substantially the same as part or all 
of SEQ ID NO: 3 or SEQ ID NO: 5. 

14 . Antibodies immunologically specific for 
part or all of the polypeptide of claim 11, 

15 . A method for determining the 
predisposition of an individual to develop Wolfram 
Syndrome, which comprises examining a WFSl gene sequence 
of the individual for mutations resulting in expression 
of no gene product or a non- functional gene product, the 
mutations being indicative of the predisposition of the 
individual to develop Wolfram syndrome. 

16. The method of claim 15, wherein the 
mutations comprises sequences selected from the group 
consisting of 2812del (TC) , 1685del (CCTGCTCTATGTCTA) , 
2341C to T, 2254 G to T, 2114 G to A, 1681 C to T, and 
leiOins (CTGAAGG) . 

17. The method of claim 16, wherein the 
mutations are. detected by PCR amplification using primers 
selected from the group consisting of SEQ ID NOS: 32, 33, 
34, 35, 36, 37, 38, 39, 40 and 41. 
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2 1 
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1 MHSGTPPPSPSOPPPPPAPQPQAKARLNATASLEQDKIEPPRAPRPQADPSAGRSAGEAA Mouse 

1 MDSNTAPLGPSCPQPPPAPQPOARSRLNATASLEQERBERPRAPGPQAGPGPGVRHAAAP Human 

61 APEPRAPQTOSREETDRAGPMKMSVEIPFEEVLEKAKAGDPKAQTEVGK^^ Mouse 

61 A-EPQAQHTRSRERAIXSPGPTKGnMEIPFEEVLERAKAGDPKACTE\ra Human 

121 EUJSCSAVAWlJU^QGRREAVKLiRRCIADRKGITSENEAEVKQLSSETDLERAV^^ Mouse 

120 EUJSCTAVDWLVIJUVKQGRREAVKLIJU^CLADRRGITSE^ Human 

181 ALVMYV«UIPKKKKQVAV8ELLENVGQVIffiQIXX3W0PGPVPKSI/)K^ Mouse 

180 ALVMYWKU^KKKKaVAVAELLENVGCVNEKDGGACPGPTO Human 

241 nyIALDDFVELTKKYAKGIIPTNLFLODEDEDEDELAGKSPEDLPIJ^QKNA;^^ Mouse 

240 OTIAIJ)DFVEITKKYAKGVIPSSLFIiOD-DEDDDEtAGKSPEDLPLRUCV^ Human 

301 iKEYLIDVASKAGMHWI^TrV^HHINALIFFFIISNLTIDFFAFFIPLVVFYLSFVSMV Mouse 

299 IKEYLIEMASIUGMHWLSTIlPTHHINALIFFFr^SNLTIDFFAFFIPLViryi^FlSMV Human 

3 61 ICTUCVFQDSKAWENFRTLTDLIJURFEPNLDVEGAEN^ Mouse 

359 TPTT n-rnn-rr^T-rTTTFTTT TTT ^ ^ ^^t,i^ rv.7vnj^j^^r^ffiJi^r ^.VYAllPl^^ Human 

421 FPIJ^KIX:iPCSEIAVISTFFTVTSrXKSLSSSAEPYTOIUa.VTEVAAGI^SI^^ Mouse 

419 FPIASKDCIPCSEIAVITGFFTVTSYLSLSTIMPYTm?aATEVTAGliSI^^ Human 

481 RFLKVI/3QTFFTVPVGHPlILNVSI.Pa-LYVYLFyLFFRMAC^ Mouse 

479 PYUCVLGQTFITVPVGHLVVLNVWPCLL^VSiLYLFFRMAC^^ Human 

nns483fsrter544 L504 A508 

541 MWCELSWILLQSTGLGLVRASIGYFLFLFALPILVAGIJaSCTVQF Mouse 



539 



MWCELSWIIiESTGLGLIJlASIGYFLFlJALPILVAGI^VGVl^FARWFTSLELTO 



Human 



601 VTTVICGVPLIJTlWWTKAKFSVMaiVKSLTKSSMVKLILVV^^ Mouse 

599 vWAVCSVPIJ^IJIWWTKASFSVVCaiVKSLTRSSMVKLILVW^ Human 

X648 

661 VYNSTLTOQQYGFLOSPRAWKETNMARTQILCSHLEXSHRVTWTGRFKYW^ Mouse 

659 v™STLTVJQQYGALCX3PRAWKETNMARTQILCSHIX3QHRVTWTGRFK^ Human 

V695 

721 AI^IMLPFFI^DWMRCLYaEAyPflCSSaOTSTAEEELCRIJCQIJaCHPCHIKKFDRYK^ Mouse 

719 AirmLPFFIGDWMRCLYQEAYPACSPOm'STAEEELCRLKLIJa^ Human 
L724 

781 VGMPF--GTNGNRGHEEDDITKDrVIJlASSEFKDVLLNLRQ^^ Mouse 

779 vcaiPFSSGADGSRSREEDCVTKDXVIJUVSSEFKSVLLSLRQGSLIEFSTILEGRIi^ Human 

839 VFEIJCAISCLNCOTQLSPAKRHVKIEQEWRSTVHGALKFAFDFFFFPFLSAA Mouse 
839 VFEIJCAISClJ^CmQLSPTRIUIVKIEHDWRSTVHGAVKFAFDFFFFP^ Human 

A882fsAer937 
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Figure 6a WS-2 
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SEQUENCE LISTING 

<110> Permutt, M. Alan 
Inoue, Klroshi 
Muekler, Mike 

<120> Gene Mutated in Wolfram Syndrome 



<130> WaehU CI-0277 

<150> US 60/102,031 
<151> 1998-09-28 

<160> 43 

<170> FastSEQ for Windows Version 3.0 



<210> 1 

<211> 67212 

<212> DNA 

<213> Homo sapiens 

<400> 1 

aagcttgcta cactgcattt gactctgggc actgatagca gtacttggct gcagacactc 60 

gttccaaaca ggccccattt ttccatctct gctgctgtta ttaggggagc ccttagactc 120 

tcttgcagcg ccggaatagg cggtcaagac gtgtgttaat atcgcaacag caaatataat 180 

gaatctgcag ttggggacgc tgaggccgga gtggtggatg aaaggtggcc ggagcctttt 240 

ccacgggtcc aaaccacctg ttacaggaga aggcgagcgg cctcgctaag caactggacg 300 

ttccgcgggc ggggcggggc cggggcccga gtccgctcgg aaactttcgc tgtgggcgag 360 

ccggacccgc cttctggccc ctcgggccca accacgcagg ggtagtgccg ggccgcagct 420 

aggcgaggcg caccgtgatt ggcggagatg tggagtgact ggcggctaca ccggccactc 480 

agcaggccga gctggcgccg catccggggg gccgcgtctg gagtggaggg aggccgaagg 540 

ccccgcccct gccccgcccc ctcgtgcaga aggccgcgct agccggctct tcagcagcga 600 

gtgcagattg ctcccccgcg gccgcagatc tcccgtttgc gccgcgttca gctgctcccg 660 

aacaactttt ctgccggccc agaggcccca gggcgtcgca gcgccgcgtg cggcccactc 720 

acgggccggt gagtacttcg gcgctggggc agtggcgcgg tggccgtggg cagcgcgcag 780 

gaggcgggca agagccctga ggcactgccc tcctcgggcc tcagtttccc cttccgagct B40 

gatgggtggc tggccccaaa gtcccgacaa ggtccccgaa gttggagggc cgggggtccc 900 

gcccgctctg caacgcgcaa ggcgacccct gttcccgggc ccgaacgggt cacccggggg 960 

gcgcgccccg gtcccccgcg cgctgtctgc atggagcccc gccgcgggcg ggacagcagg 1020 

cccgagaggc ggagtggccg gcgggcaggc cccctccccg cgccccgcgc catccccccc 1080 

gagttgcccg gtcatgcccc tcccccgcgg tcccgggacc cacggccgcc ttaggtgccc 1140 

gccaggccgt tggaaggcct gggcccccgg gaggtgttgt ggggggaagg ggggttcagt 1200 

cccccagggg gacccggccc gcccgagggg cagcttaggg gtcactctga atccccacag 1260 

cccacttagg agcgctgtgg cagttcctct ccctctcggg gcccctcagt agagagttga 1320 

ggggtttcta gtggcagcac ctggaccttg ccccgtatga actccacctt ctggatcatt 1380 

agctttttgt ttgatcaacg aaaactttct ggagaatcct acccagccct tttggatgtg 1440 

atggcgccct ggcctctcac ccacttagca gatgatgtta tgaattccaa ggaatcgcat 1500 

ttgtgcaacc cccaccaccc agatttaacc ctctcagtag ccttaggcgg agaagttata 1560 

gtccccgttt gacaccccag gaagctgagg ctcagcaggg tttggtgcca gcccaggacc 1620 

acgaagctga tggggcagag ctggtgttag gcctctgccc tatctctcca gaagcagccc 1680 

gtcccccctc accttggaca atgacccctg gcatttggcc caggctttgc agtttgtaaa 1740 

ggctttctct tatgccagca tccatcactc caggagctgt gccctgggta ccccatgtgc 1800 

tggctgcctg ttctctgcag caggagaggg agcccacggg aagcccagag gccacctttg 1860 

tcactgtccc cacctgttta cttctgtgtc caagggtctg tgattcccat ggggtctccg 1920 

ttgtatccaa ccagggattt ggagggaggc ttctccacat ttcagcctct tttccctacc 19B0 
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tgctggtttg tcctcccttc gcagacacag gaaaagttgg ctgggagacc ccatctcacc 204 0 

ttcttcccca ggatggaaga gcatggaggg cgtctctgac ttaggccCcc cctgcccatc 2X00 

ccggaccaat gctggacagg cctctggggc tggtcacaga ggggacctca gctcagtctc 2160 

aacccccaga gatgtctgca ctgccagaca gacccgaaac acccatcctt tgacggtggc 2220 

gataagaatc ataataggtc gggagcttta tgctacacgg gtcccctccc ctcccagtitc 2280 

cccacttccg gaagtggtac caccacagcc agccagcccc agccacaaac cagaccaccc 2340 

cctigtgccct tcattcctca ctcccagttg ggtctctcct ggcticctgag cctcactcgg 2400 

atgcctgccc tggttcagga ccctcgcctc tccctggacg acccccaggc tcctccctgg 2460 

tcccgatccg ggcctggctg cacgcagccc tgctggttct tcagctgcca cttccctctg 2520 

ggctatcggg tcaggtccgc cccccgagcc aggagccggc attctttcgg gccctcaggg 2580 

acccgcctcg ctgctcactc agcctcrccc tcctgtgtcc tctctccagc agtcacctcc 2640 

cagccacctg cgtgctcctg gccacaagcc tggctcactg tccctctgct gcctcctctt 2700 

ctgggcacaa cccctgctgg aacgcccagc cacctcccag cttgcctggc tcccgcctgc 2760 

ttgtccagaa ggctcagtca aggcctcctg caggatttct ggcctgcctc ttcccacagt 2820 

ggagtgaggg cccccttctg tgccgtcttt gtgggtgtct gtttaggtgt taacccttcc 2880 

catctagatg aagctgctcc aatgctggcc ctggacaaga cttctctagg gcaggggggc 2940 

ccagcacagt gcttggcaca agatggacct accctcagtc cacacctgct gcccatccag 3000 

gccgggcctg gagcaggtgc caggtgtgtg gtgggggctt ggccactatg gccgtcatga 3060 

atgctgcttg gggcagtccc aatrtcaaacc cctctgttcc tgtgaatgtg ccacattgga 3120 

tagagccgga atgttccttg gtttgtccct taatactctg agcaaaaata ggtacctggg 3180 

atccatcaag ggctttgtgc ccttgtcttg aggagcagcc ctgtgccagg gggctgggtc 3240 

aggccaggcc agagcagaca tcccagggcc aggacctcct gggcctgctic cctcagagct 3300 

tgctcacaga tgggttttaa aataacattt gccttaattt caaaaacatc acacattcat 3360 

tatagaaaac tagggaaata tgtttaaaag aagaatiaaaa tgaaaccgcg caHaagcttt 3420 

atccccagag acaacaagta tcttttttag ggggcgtttt cttccgtttt ttggcagata 3480 

aattttgttt tcatcctaaa cagactccgg ctaatatctt ctgctgcttt aaatcctgcc 3540 

ttaacagtgt taggcccacc ttttccttgt ctcaaaacgt ccatgcacga ggticacgttt 3600 

accagctgta cagcgttgca ccatatggag gggtgtaggc tttatagcca gcccctttct 3660 

ggacagtgct gaaagtgcca tatgatggac attgtgtgtg agccccgagg gcatctccag 3720 

tgactcccct ggactggcac cctggaggca gaggcactga tcagagagag tgaacatcat 3780 

cttggcttcc cacaggggct gtttctgagg cactggcttg ggagatgggg ttgccccccc 3 840 

agagagaaca gaatgggggt ggcctgggtg agggccagag cccttagggc agaaggccac 3 900 

gtgtggtcat tctgctgcgg gtctgagtgg atgcatggtg cctgcagccc tctccaggtg 3 960 

gcggcctctc agctgaactc cttggtgccg gtgtttgtgt ttggacggga taaaaatatg 4 020 

cctcattgct catgaggaaa gaatgctagc aactttggtt ttgtggatct tacacagtga 4 080 

cggcttctaa gccattgctg tggtcaccag tagggagctg gggctgagtg ccccaccgtg 4140 

caggtgggga tctgagggtg cagagtggat gcagctggag atgcctggat gctggggtct 4200 

gggggaggga cctgctaagg tcccacagca ttattgcctg gatggcgctc ctgccctagg 4260 

ccaggctgcc tcaggttctg cccacatigca ttaagtgctt aacgtattcc tggcaaggca 4320 

gccggtcttc cgagtggata agctgcagcc ctgccctcag ggctcacagt ggctgtaaca 4380 

tgcaggaaga ctcagagagg gaaggaccag acagcaggtc cttccaccgg gagatctttt 4440 

ctgatcttca ttgtccctgt gaccaaccat gaccaggaat tgttgtgctt gttttgaaat 4500 

aacaagccct ttgatccaaa ccagacttgg ccaaccccca gggaacctga aaattaaccc 4 560 

tcctgtattc atccattctt actaaccaga gattttactt taaacagttt ttttttaact 4620 

gcagtgtgac atacatggta cataggccag ctagaatctg ttttcctgaa attcctttta 46B0 

tgaagtgttt tccaactgaa catgcctttg tttagcacct gggagtaatg gcttagggtt 4740 

tttcttttcc tttttctttt tttttttttt gagacggagt ctcgctttgt cacccaggct 4800 

ggagtgcagt ggcgcgatct cggctcactg caagctccat ctcctgggtt cacaccattc 4 860 

tcctgcctca gcctccccag tagttgggac tatcggcgcc cgccaccacg cctggctaat 4920 

ttttttgtat ttttaataga gacggggttt cactgtgtta gccaggatgg tcttgatctc 4980 

ctgaccttgt gatctgcccg cctcggcctc ccaaagtgct gggattacag gcgtgagcca 5040 

ccgcgcctgg ccggcttagt ttttttttaa tgcccaatag tttgctttct cttctgccac 5100 

acacgtaatc cccatagcac ccctatgtgg gaggtgctat ccccatttta catatgggga 5160 

aacagacaca gcgaggtcat: gtggccacta gtaagggtaa ccgacgcccc acaccatg^t 5220 

gcttccagct gtaacctcta gcagagggag ggctctgatg gagaggtgat cgcctgagtg 5280 

ctgtgtgttt acctgagtag gtgaggaaga cagggactcg ggacagacat: agcttggaag 5340 

ctcccacaaa ggaggtatcg ctggagctct tgacttgaag ggtgcttagg gtctaggctg 5400 

ggggaggggg gtaagaaggc attccgggaa gagggccctg cagaaacaga agtgtggagg 5460 

tcagcatttg tatgctgatg ticgaagggta gccagtgttt tgcagcgaat gggccccggg 5520 
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agagagggtg cagtggcagc ccgatttcaa agtcatcttt ttcagaaatg agcttgcaga 5580 

agcattiagaa ttgggcaggt ccggatttga atttcaggtc tggtacctgg agccgtttga 5640 

ccttgagcag gtccctcccc tccctgcacc tgggttcctg tctgtgaagt ggggtgaggt 5700 

tgaaggataa aacacttccc catctataac tctgggatga caataggatc taccttccag 5760 

gtccatgcga ggaactcacg acaataggac ctaccttcca gggccatacg agaaactcat 5820 

gagaaaagac aagtacagag aacagagccg ttggtcttga tggcagagat gccctggcgt 5880 

aggtcagccc tcagcaaagg tgccctctct ctctgtctga gaaagtgaca cctgacttcg 5940 

gggcagtggg aaccatggga gagaaggtag acaaggccac gtgtcatttg aagacctgac 6000 

ttttcaccat ctcaggtggt ccggtggcct gcaggaccgt ggttctgttt gcccacagcg 6060 

tgtgggttct acgtgcgatc ctgtatttct ctgccacagc tgactaaggg gacacttagg 6120 

tctgcacgtg cagttaaatg gctgcttttg tggaaggtgc cagttcccgg gagctgggaa 6180 

cttggaaaca ggatggagca ggctcccaag cttgaccatg catggtttgc aectggggga 6240 

tgcacccggg ggtgcttgac catgcatggt ttgcacctgg gggatgtttt acactaaggg 6300 

ctcagtccct ctgagggagg aacgtgtgga ttgaaatctg cagagttctt gagcgacacc 6360 

agtctgatct ttaggactca tttgcagcgg gaatggagca gaaattgttg tgtgtggagc 6420 

ctggggccac tcctgactct tccagtcaca gctgccccca aaagtgacct ctgggcacct 6480 

actcagacct ggctgctcag tgccaagcac tgggtgttca gaggtggcca ggggactccc 6540 

tcgtgccgcc catggtgtct ggtctcactg gtcctcacag ttgccccgtc cccattttca 6600 

gatgaggagg ctgaggtgca gagcgggatg tgggctgctc aggaccacac agctggtggg 6660 

cacagagcca ggagtacagc gctgcggcag cttggggctg ggagcgagac tctctgttac 6720 

cagcctctcc ctgccctgcg cctcagagga tggctgtggc cctgagcttg ggagggtcat 6780 

ttgggaatct gaggaaaagt t^ccacagcc tcacacagca gcaggttcaa ggtccagcag 6 840 

tgccaggggt ttgcatccca cccctaacgc cgctcatcca gggcccatcc gggtccgggg 6900 

ctcgggatcc tgcagcgctt tcagctctgt gtctgtggcc ggtggttgtc cttcattaca 6960 

gggatggtgg tgttcttcca tgttgctaag cagcatgtga gggcctctgt tgaataaaat 7020 

aaagcccttg ctccggcgtg acacgggtca cgggatctct gctcactcta gtctcttctg 7 080 

atgcccccac cgcactgttt gccttagttt gctttttact ctctgtgttg atttctgggg 7140 

catcagtcct ctgacttcgg tgttgtggct tgggcctggg gtgggggctg tggaacggca 7200 

gatctgggta tgaatcccag ctccacccat ttggctccct ctcttctctc ccccgccctc 7260 

ttctcccctc cccagctccc tccccctctc atatttactt aactttactc agtaccagcg 7320 

tattagtttt ctagggcttc cgtaagaaac tactggaaac ttggtggctt aaaacaacag 7380 

aggtttgttc tctgatggtt ctggaggcag aagcctgaag tcaaggtgtg ggcagggtgg 7440 

gtccctcctg gaggctctga gggaggctca gttcctcgcc tctgtccagc ttctggcagt 7500 

ggccagtggt cctcctgctc cttggccact gtcacccgtc tccgcatggc tctgttacac 7560 

ttctcttcct ataggccatc agctcttgga gccagggctt ccctactcca ggctgatctc 7620 

atttcaagat ccttaaatac atcttcaaag acctgttttt ccaaataagg gcacattcac 7680 

agacattggt gggcacatga tgtggggccc aattcagccc cctgcagcca ggccccacat 7740 

gggtgggctc tgggtgttgc tatcatcatc cccgcttctt ggatgaggga aggccagggt 7800 

tgtgtggctc atgcccagtg aggcagggac tccaggcctc ggcatgtgca ggtctgagat 7860 

tgtaagtgcc atgccatctg tagagtcacg tgggtgagtg tcctcccatg gtttcctccc 7920 

tggaagcggt gctggcccat ggggactgta ctgagtgtca gcgagaccct gtatggagtg 7980 

tctggcagct cccacccgcc tccctctgct tttctgtctc cagcagacac taagtgccag 8040 

agcgggctct gccggtgctg gatgtgcctg accttgactt ttcttccagg caggarggac 8100 

tccaacactg ctccgctggg cccctcctgc ccacagcccc cgccagcacc gcagccccag 8160 

gcgcgttccc gactcaatgc cacagcctcg ttggagcagg agaggagcga aaggccccga 8220 

gcacccggac cccaggctgg ccctggccct ggtgttagag acgeagcggc ccccgctgaa 8280 

ccccaggccc agcataccag gagccgggaa agagcagacg gcaccggtaa gggagcaggc 8340 

tgggaagccc aggctgggga tgttcaggga tagctgggtg ggaacggggt tcagccaccc 8400 

ctggagggtc cccccgccag gtcctctgca gctcagcatt gtgcagctcc catgctgtgc 8460 

acaggcgtcc acccagtggg gctacccacc tcctcagagc cttgcacctg tcacctttgt 8520 

ggcacccact cgaggtggtg ctggtgcccc cactcctctg cagtccttct cttcccaggg 8580 

cctctgcagc acttcacagc ttccatttgc aacagcgtcc aaacatgtgg agtaaattca 8640 

cgggccctct gtaaccaact agggtggcag gccagaggtg acagccacac tctcgggagt 8700 

gaggccacat gcgggcgaag cccttggtgc tgagtttccc ctctccctga gggagacttg 8760 

tgagggacgg gcgtccttca cctccatcct gaggcgggat gctggggccc tgggtgttca 8820 

caggcagagc tgactgaggc ccttggtctt cacaggccga gctgactgag gccctgggtg 8880 

ttcaaagcag agctgaggcc cttggtgttc acaggcagag ctgacaggct tgttggttcc 8940 

tgggtggtga tgtgagggtg gtgggtgtga gccaggggag tccggcccct gggcaccttc 9000 

tccagetgac tcaggcctcg gtcagccttc acttttgcta gagaggccgc tgccagggag 9060 
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gctcagtccc agcccatcac tgcccctccc gcccacgcag aagccccaga cccatggtga 9120 

gtgcctgctg tgtacaccag ccttttccat gtgcctgcag gactcctcat ggcagtcctg 9180 

agaggggaaa ctgagacacc gagcctaggc gactcgccca agagacccag cgggcgagag 9240 

acgaggtgct cgagtggtcc aggtttcctt cttgccccag tcaccccact ggcagggccg 9300 

tggggcctcc ctgctgtggt cgcagcgcgc tgaggctccc atgtaccact tatgctgggc 9360 

atgcccttgg aaactcactg cagccctggg ccctgtttgt cttcgcatct cctcaaaaga 9420 

ccactgtgct cctctgcttt gttttggagg cctctgggcg gatgagggcc tgaccccggg 9480 

atttgggccc agcctcacgg cttcagggga acagatacca gggctcggaa gcttggtttc 9540 

aaticcgggca acgagaacag cqcgcaaagg acagagtrcca agaccgtgtg tctcagataa 9600 

gaagagtcag gaagcagctg tggtttgttt ggctccggtc tccaatgrgg gggcggtgaa 9660 

gagcaagccc tgggcgccgc ctgacagccc agtccccacc actgggcctg cagctggggg 9720 

cagtgggcat cccgtccccc agccactiggt cactgatgcg gccaaagcag gcccggccct 9780 

gcatgcccag tggccgcagg gagggcaggg cccagggtcc ctgccagtag cagctgctgc 9840 

gagaatctgt tatttagctc ccccatccag ccaacatagg tgggtgagca tcagttgaaa 9900 

taccaatagg taacaacaag ggtagctatt taactgacac ttccccgagc tiaggcatcct. 9960 

tttagggcat tagcccactt gcctctttaa cagccctggg aggcaggctc tgctgctgcc 10020 

ctgtttttca gatgggaaac cgagggatag agggcttaag tagatgaccc agcgtcccgt 10080 

ggctggtgaa gggtggaggg gagatttaaa cacaccgtgg agcctgtgct cactatgggc 10140 

ctcaggtcat, ctigcgcccag gga^ccccat gtcccaggct cagagaagtg cacagagcag 10200 

gttggcctgt ccctagatca gtgcctggcc aggccctggt gacaggagtg aaggctcagt 10260 

cccgccctcc gggagccctg agcaatatgg gagacaggag tgaacaagga actacattct 10320 

ggggctcatg ragcgcgggg gacaacccac attgcagtcc cctaggggcc ccccaggagg 10380 

gcctggccac tcttcagggc atcaggagga ctgcctggag gaggcatcgt cggagcccca 10440 

tctgagctgc tccttggcct gtacacttgg gaaacaggag aagagcagcc ttggggtitgc 10500 

ccttctctgg ttgcctccct gaggaactga cccccttttc agcccttcat tcagcaaggc 10560 

tggggagggg ctgcttcaga aacgtcactig ggctggagcc ccacacttcc ttcccatcct 10620 

ggctctgtgg ccttggcaag tcacctgcca cccggggaac cactgccgac gggtcggggc 10680 

ttggaaactc aggggctcct gggcccagac aggtaaagga aacagtgact ccggctagaa 10740 

aggagactgc agggattgtg aggctct.tgg gcgtcctgtg ccagggcggg actcctcatg 10800 

gcagccctga gaggggaaac tgtgtcctgt gtggagcgtg gatgcccacg tgctgagagg 10860 

tgttgtctgt. cctgattttt gacagcagcc caagt:gatgt gccagtgggc cagggccggc 10920 

acctgcctgt cggtggctgc cccagggctg atgggcaggg gccgtttctt cacaggcagc 10980 

tgtggtgagt cctgtgatca cctcagtgtt gtctccagcc ctggggggat ctgttgttct 11040 

gctcaacaag agaataaagc tigggctcgrc acacctggtic ccaggtgacc cagcaggcac 11100 

agagaggtag agttgacacc aagcctcact cagccaccgt gacacggccg tgcgagtcca 11160 

cgccctggca gagagcctgt tctgggcctg ttctgtttag ccctggggct aaagggcggg 11220 

ctcagacgca cagagggaca ggcccggcgc aggtgggagg aaggaaagca gactgtgtca 11280 

tcgtctcagc ttcaggctgg gcgtgctgtg gctatcatgt gcttggcctg tgctaagtgt 11340 

gtgctaagca tgtggccctg gtgggcactc aggcctcctg ggcaccagcc gaggccccac 11400 

ccctitcccac tatgcggccc tgagctcccc tgctgtctct gtcattcacg ggtictaccag 11460 

caggacaagg cctggggttg gtggaggagg gaaggaggtg agatttggga gctttccgag 11520 

tcgtacccgc cactccgtct ccgagcagca gcactctcat ctctacaagt ctctacagca 11580 

gtgctgccac cgccttgacc agggaatcac cctactgtga ggtggcccag ctcactgtga 11640 

gactctgggg cagtcatatg ccctttctgt tgaccccagt gggtggccag cccacacgac 11700 

tgctgaatct gccaggcctc ccagccattg ggggtagggg gattgctttg gcagccacag 11760 

ggacacagat gcggacagga gcaggagccc tggggaggag gtgcacccca ccaggaagca 11 820 

cccaccaacg ct-gactgagg ctgcactgcc ag^tagagcc gctgggcgag ggacaaagaa 11B80 

gggccttcgg gtgtggggaa agcaggcacc ccagctccta cactaagggc accctgggca 1194 0 

gagagggagc ccagtggggg gcacccaaga cccacgagga attggctgtg gtgtcaagag 12000 

gtcagccagg ccaagcaggg ccctgagggt cagggaaagg agagggcatt gtcgcgagcg 12060 

gggcccagct gttagatgga ttgtttacat taaaggccag ccaggaggcg gagcagagag 12120 

aggagtagcc aggccacaag aaggctgagg gacaggtaga catggagggg agaaagccag 12180 

ggtggcctcc aggtttctgg ctggggctgc caggtggacg gtggtggcac agcctgaggt 12240 

agggccctgg gggtggttcc cggcggggtc cgagcttagt cttagagggt acaagggggc 12300 

ccaggaggga ctggggctct gatgcagatg tgggagtgcg ctgagcccct gctgccctgg 12360 

ggacaggcat agcaggaggt gctactcagg gcacccaccg ttgattattt aggaagcagc 12420 

tgtcacgcct gttctttatc ctgacctggc ccatctgatt gctgatgtgc tcctgctgag 124 80 

ccccccacac cacactgagg actggccaca ggatctggtg gcactcagtc ttctgggaac 12540 

ttggcagtgt gcctgtccca ctttgcgctc caagcctcgg acgtcacctc caggaaccct 12600 
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cccactgtgg 
tcgttcttgc 
gtgccctcgg 
ctg^ctgtcc 
ttcagcctgt 
ggtgcagggt 
cctgtgccct 
tacctttccc 
ggaggtcaag 
ggagacggga 
catttgacat 
atatgaaagt 
ttttgagcca 
ttcattgtgt 
aatcagggag 
tatgctagga 
ctggaatccc 
acgggtgaga 
gatggacgat 
atcctggagg 
ggcatccatg 
agggtgttct 
atcacactgc 
ccttttagtt 
ggcaagggat 
ctggatccag 
gctttcctgt 
agctggcatc 
cacaaactcc 
tggcaactct 
agtgccaccc 
gcagtcatca 
ggttcttggt 
cagtgagcat 
taactggtct 
gtgcgtacac 
actttgggag 
acatggcgaa 
ccgcctcftiat 
cagaggttgt 
actctgtictc 
cttgaaatgt 
ttttagacca 
acaatatgac 
gttgtggtgg 
gaaagccncc 
cataaaccat 
tgtggcctgc 
gtgtcatcgt 
gtgtctgcag 
ctggggggct 
agcaccaagg 
ctgatgccat 
ctgtgcaaga 
ggcgtccagg 
ggggacatcc 

ggaggtggcc 

ccagggatgg 
gttgtccatc 



gtgatgtcta 
ccggcccagg 
ctacgcaggg 
tgttcccgcc 
ccat^cacga 
gaccccagct 
gtictctggca 
tttggcacag 
ggcttgccca 
ggattacatc 
acatccaaaa 
ggaagccagc 
gctgtttggg 
gggaagaagc 
ttgagcttgg 
tttcagaagg 
cgtctgctcc 
cccacggaca 
gacccaggtt 
atgccctagg 
aatctctcaa 
ccatggaacc 
tattttgtga 
gtaagtgaca 
tgatgagtcc 
gggctcagat 
gctgcctgtg 
cacgcgccaa 
gctgggctgt 
gattggccag 
agaccatgag 
gaaaaagcaa 
ggtccacact: 
gctgggattt 
gcaggacgag 
ctcaaagaga 
gctgaggcag 
accctgtctc 
tcccagctac 
ggcgagccga 
aaaaaataat 
caggctgtga 
ccatat.agac 
ttgtgaactt 
gaagtacagc 
tctitgtgaac 
cctatcagat 
accaggttgc 
gatcgctcac 
ttcctctagg 
ggcagggtga 
tctgggggct 
ctctgagcag 
gttgcgtcca 
aagaggggca 
agggagaggg 
tgtcagggag 
gggcacccag 
cacagggctg 



cagtcaccta 
gcaggccgtg 
cctgttagaa 
tcagagttca 
gaatcattgt 
tccccatgtg 
cagggatgat 
ccccttgttg 
ggaccagtga 
atatttattt 
atttttaaac 
tgccctgc^c 
aagggaggcc 
cttccctcca 
cagttcagct 
agagagggca 
ct.gtgtactg 
tagatggcat 
ccagcaagat 
tagtgcc^ga 
atccataggg 
tcacagaggg 
ttgataggca 
aaatctcaat 
atagatctgt 
gatgtcatca 
tctcaggggc 
gatgagcaga 
gCccatccct 
gtgggggtgc 
gactgagagt 
ggaacggagg 
gacgcagccg 
gcctttgagc 
cccaccactg 
ttctgggctg 
cagatcacct 
tactaaaaat 
tcaggaggct 
gattttgtca 
aataaaaaag 
ataactt:ct:g 
ttggaccttc 
aggtcaag^a 
aacagccctg 
agcaactttg 
tgatgttaac 
acagcacctt 
gctgct.ct:gt 
gatagtgagt 
cggcagcaag 
ctggggaggg 
cgtccttgag 
gggagagttia 
ctcagggaga 
gcgtccagcg 
ggtggtgtcc 
ggaggggggc 
gagtgcgcgt 



cacagcccca 
gtgagctcag 
gggCgccctc 
gccagcatag 
cagtgtcaca 
ggggatgcgc 
gacagcacac 
tgggcgtcac 
gggagggagg 
accacatgcc 
actatggaag 
ccagcccact 
ccagccactg 
tccttttcca 
gctcagccag 
gctgcaggcc 
^ccagcgtcg 
ccacgaaggc 
atggcagtgg 
gaaagt:gagg 
cccgtgcatg 
aaggagacct 
gact:&tcaaa 
ataaattgac 
agagcctaga 
ggaagragtc 
tctgcccaca 
aggtgaaagc 
gaaccaggtg 
gggcc^tctg 
ggcggggttc 
tcgagcacac 
gcttctgggg 
ccacgaaggc 
ccatatggaa 
ggcgcagtgg 
gaggtcaggg 
acaaaaaaac 
gaggcagaag 
ctgcattcca 
agagagagat 
t:gaaaaat:ta 
catgttttga 

^ggagacttg 

gggtctccrg 
gaccactcac 
gatcagattg 
actgtggagc 
gggcccatgg 
ctgctgctga 
gaaggagaac 
tgggtgtcca 
gaccccggca 
catccaggga 
gggacattca 
agaggagccc 
agggaaagca 
ttccagggag 
gcctgaccac 



actctcatgg 
gggtcctgcc 
cctgccaggg 
aaagagaggg 
gaggccgatg 
cggtggtgca 
agcacatcct 
gagaggcgca 
gagagggcag 
taagatcgca 
tatgtctgga 
ttgggaaaat 
ccccttgggt 
tttctgcaga 
ggaggtgctg 
ctatttgcag 
tagtggaagc 
tggcggagcc 
gctgtgccgg 
gtcattgact 
acaaataat^ 
gcccatggta 
gaatgagtta 
acaagccaaa 
gtagtttgct 
ccccgccccc 
gggtgacctt 
ccatttccca 
ctgtagtcag 
^gcagcgcag 
cccaaaaggg 
agaaaacacg 
gtgatctttc 
tcctgcttca 
gctggaataa 
ctcagacctg 
gttcaagacc 
ttagctgggc 
aatcgctcga 
gcctgggcga 
tctgtatatc 
attatggact 
tttttttaaa 
aattcactag 
gggaagtggg 
tacaaagcat 
aagctgcttc 
atggatccac 
ccttggtgcc 
gcccgcagcc 
tgtccttgat 
aaagggctgc 
aggaagctga 
gagtt:acgt:c 
gggaggagtc 
ctagggaggg 
gcatcaaggg 
aggggccttt 
gagagctcgc 



gtggacgtgc 
tgtcagtc^g 
cctccagcca 
ctgaggacca 
ggatgcccgt 
aggccaccac 
cttttctttc 
gagagatcgt 
cagcaccctt 
ctctcaagag 
tigcatcagta 
tttgtgcatg 
gggtctgatg 
atatctgctc 
tgtcctggac 
cggcctcccc 
tggtggcgag 
tgtigatagac 
gaaggggcgc 
garcccacgt 
aaaaagatigt 
tccagctctg 
gattcaggac 
aaagtagagg 
tcaggcacgg 
attccactct 
ccacagttca 
gcagttctag 
ggagatgacc 
gggagggggc 
acatcaaggt 
tgtgtttgat 
t:ggcactgag 
tggactgagc 
tgaagagggt 
taatcccagc 
agcctggcca 
atggtggcgg 
acccgagagg 
tgagagcgaa 
ggggagatgt 
ggaagaatct 
aaagggaaga 
aggtgctggc 
ggagggaaga 
ctcttttcat 
ggagcagtga 
acgtgcgtct 
tgcaggctct 
agaaaccacg 
ggattcaggg 
cccgcgggac 
gcccgaaact 
cagggagagg 
tticcagggac 
gtacaticcag 
agaggggtgt 
gtgttggcga 
gaagaaccca 



12660 

12720 

127B0 

12840 

12900 

12960 

13020 

13080 

13140 

13200 

13260 

13320 

13380 

13440 

13S00 

13560 

13620 

13680 

13740 

13800 

13B60 

13920 

13980 

14040 

14100 

14160 

14220- 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

1494 0 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 

15840 

15900 

15960 

16020 

16080 

16140 
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tctgattctt 
cacctgccac 
gcgacccttc 
gcctcctgta 
ccggttcaga 
ggctggtcct 
ttgggtgaaa 
taaaatgcag 
aacccgtigct 
aaaacaagag 
acctgactct 
catcg^ggga 
attcatttgc 
gtgatgaatc 
gggagctccc 
caggccctca 
atcgaaagaa 
ttttgttata 
caagatgtat 
ggaaaacatg 
ttaggtcatg 
gcctaggcca 
cccaccccga 
ttgctctggc 
ctgtgtctgt 
ctgaagaccc 
catgtgtgtt 
tgaagaagtc 
tgcggtgccg 
cacagcccac 
ctaccccact 
tggcactcct 
gcatcacctc 
t:caacat:aca 
ttagctgtgt 
tgggctttgt 
tggaagaagc 
ttctggtgag 
cctctgtgga 
tgaccacagt 
tccacgcgaa 
ccagcacttt 
tggccaacat 
tgcatgcctg 
aagtggaggt 
agact:ctgtc 
actccaaagc 
agccctttta 
actggaccat 
ctgcgcaatc 
aacatgtaaa 
gtgggccagt 
cctgactgtt 
gtgagccttg 
cccgtgtctc 
gcgagtggcc 
tctgcccctt 
cagggagcat 
aggtggggaa 



tccttctcca 
ctgcgacttt 
ctctctgcct 
cacctgggtc 
gcctggcacg 
cagtgtggtg 
tgggtaagac 
cacaaactgt 
aggaaaatga 
gttattggaa 
gtctccttga 
agccagagaa 
tgaggtcttc 
atgagcgatg 
ttgccccttc 
gtagacacgg 
at:acattt:ct 
acagctcgaa 
atacactgat: 
aacttgattt 
ggctgtcaaa 
aggggtgtga 
gcgctttgag 
gctgcttgtg 
gtctctctgt 
tcatgccttg 
tgtttcttct 
ctggagaggg 
gcagggactt 
agagaagtgt 
tgagccccat 
ctggggagca 
ctccaggatg 
c^g^gtatgc 
gcacggggcc 
gccagtgctg 
agacccaaaa 
gggagaagcc 
ggcaagttct 
tatcctgagc 
accataaatt 
gggaggctga 
gatgaaaccg 
taatcccagc 
tgcagtgagc 
tcaaaaaaaa 
tcctcccaca 
tgggctctct 
ttcagcaggg 
tctgccgtgt 
tgaatggtta 
gccaaccctt 
ttgagaggtg 
gcaggcagga 
cctcgccgtg 
ggaggcccag 
ccttcctggc 
ggggtgggag 

gcactacccg 



aacacatgcc 
cgagctgtcc 
ttgcacctgg 
atctgaaggc 
gtatggctct 
ttactgggaa 
tgcatgatgc 
agctgacgtt 
cttcccaaaa 
gcaacctaca 
tggatccagg 
caatccact:g 
acccccagtc 
gctttacggt 
caccctgtga 
agtctgcaac 
gatgtttgta 
cagactaagg 
tataatatcc 
gttggggttc 
cttgagtcct 
tgg^gggact 
ttgctgttga 
accggaaggc 
actcctggcc 
tcccctccat 
gtgrtaaagg 
ccaaggccgg 
cgggacgcgg 
cggtgcctga 
gttggtaggg 
gcgctcat.cc 
acctcctggc 
tgcccactct 
ctcagagcgg 
gggacatgat 
aacccaagta 
agcagaacgc 
caccacctca 
agtttcctgc 
aagaatttct 
ggcgggtgga 
tgtctccact 
tacttgggag 
cgagatcgca 
aaaaaaaaaa 
tccttttgcc 
gtgccctggg 
attctttttg 
tgctcagctt 
tggttgtatt 
ggttagctgg 
taaaggaccc 
gcaactcaag 
tggatggggt 
tagggcctag 
ctgggtgaca 
agggtcggag 
cagttggccg 



ccacccacac 
cagcctccct 
ccacatccca 
agtggccagg 
gggaaagccc 
ccatgcagcc 
ttttgtaaat 
ttgacccttg 
gaaaagaaac 
gacctgatgg 
agccaccaaa 
ctgtggactg 
ccatggtgtt 
ggaattagtg 
ggacacagcc 
actatgatct 
agccaccagc 
cattcactct 
aaaaaatgtg 
tgggcttcgg 
tggatctttc 
tggctctgtg 
accacctcac 
aaacagtggc 
tggatttgaa 
cctgacaagt 
gcctacaaag 
ggaccccaag 
ccccggcaca 
gatcggggtc 
tgcccatgtt 
cccttttgtc 
ttcctgcagc 
tgtctccctg 
tgaccattca 
gtggccctct 
gacaagacac 
tgagacggga 
ctctgtcccc 
cccctcttct 
ggctgggagc 
tcacctgagg 
acaaatagaa 
gctgagacag 
ccaccacgct 
atttctttcc 
caggcatttg 
cctcactttt 
caaagggcca 
tgctgtggag 
tcaataaaac 
cgtctctcca 
agtcatggtt 
gaagagaacc 
ggccacacct 
cctagtggac 
aagggaagtg 
aatctggagg 
gcgacacgga 



agccttacct 
ctgccctctc 
tggtctccca 
cagcacttgg 
tgggtgagtt 
tgtcccgttg 
tcaggtaact 
gtctaataat 
ggttcatggc 
agaggagggg 
atggtgagct 
aaagtttgtg 
aggaggtggg 
cccctataaa 
agaagacctg 
tggacttccc 
ttggtctatg 
taagaaggaa 
gtttttgttg 
gtacaggtgc 
tagagcccca 
cticgcctcct 
ccgcatagag 
tttccgggca 
agtgacaagc 
gacaaagtct 
ggagacatgg 
gcacagactg 
acaggcctgg 
aggagccagc 
cactgtgcca 
caactcacac 
tgcctgctca 
actggccttt 
cttgggcatc 
cttcatgggt 
agaaatacag 
ggtggcaggg 
.ttgcacaggg 
ccctaaagag 
ggtggctcac 
tcaggagttc 
aaattagcca 
gaaaatcact 
cccaactgag 
tgacctcaca 
gaggcggggg 
tccatctgta 
gatagtgaat 
cacgtaaaag 
tttatttgca 
gccctggctt 
ctgatttcca 
tgtaccagta 
tcctcaccgt 
atgcctggtg 
ggtgaaagga 
ctgactggtg 
tgaagaactc 



QtQaaQOCCt 


16200 


caaactCTaat 


16260 


ctggagtcct 


16320 


tactacaaaQ 


16360 


ggaccagaaa 


16440 


tcagccagcc 


16500 


tcttaggaag 


16560 


tcaactggag 


16620 


accaccgtag 


16660 




16740 




16800 




16860 


gcc& w wg^cig 


16920 




16980 




17040 


agtctccaga 


17100 


at ^ A A tat 


17160 




17220 




17280 


u wwg welw wclw 


17340 


a t A er A ei f A Q 


17400 


L> w w wa^y WW w 


17460 


w i> wy w wwa w w 


17520 


^ w ^ w w WW wy k> 


17580 


a«^r*> Anr^ jina t 
ay way wa^c&u 


17640 


yy w w ^y wy oi 


17700 


A A A t et t 
acicL WW Www i# w 


17760 


ay y wy ay yaw 


17820 


r'A eoacre t e 

wwawyay w ww 


178B0 


y ^yy wy waww 


17940 


yw^wwwwhvWW 


18000 


wwwctwwwwyy 


18060 


y wy wwaw w w w 


18120 


waww wyy w wy 


18180 


agccaagggc 


18240 


ciacaQnt t ao 

y a w ay y w w m ^ 


18300 


wy a www wy 


18360 


aa tee tot at 

y y w WW wy wy w 


18420 


ctcaagcaag 


18480 


gagaaaagct 


18540 


gcctgtaatc 


18600 


gtgaccagcc 


18660 


y awa wyy w^«j 


18720 


tgaacccagg 


18780 


caacagagca 


18840 


gccatattgt 


18900 


gagcatagta 


18960 


gaatgagaag 


19020 


attctggact 


19080 


cagcatggac 


19140 


aaaacaggca 


19200 


tctatggtcc 


19260 


tgcattgatg 


19320 


ccagtcggag 


19380 


gttttgagga 


19440 


tgaccccatt 


19500 


ggtgggctgg 


19560 


tctggcttgc 


19620 


aacagctgca 


19680 
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ccgctgtgga ctggctggtc ctcgccgcga agcagggccg tcgcgaggct gtgaagctgc 19740 

ttcgccggtg cttggcggac agaagaggtg ggtctgtgtg aggcttagaa cagcctctgg 19BO0 

agggttgagc agcttgtaat gctgcctgct aactgaacaa ccaaaatctt accaaaccta 19B60 

acgctggtga tgctgttggg aaatttcagt ttctgttttg ctggtggcct tctcatttta 19920 

gacactgttt ctggacttaa catgggatat tcaacagacc aagccatttt cattctcttt 19980 

ggcttgtgtt ggtatctcca agtggtaact atacatcctg cttccctgct gggttctgat 20040 

ccaaactgag acatcgatcc tgggttcagc tgagaggcag tttccctgag gcaaagctgg 20100 

agactgtaat gtggcctgca gtcagagtgg cccatgcccc tgagctgatg tccgtcgagt 20160 

ctgggcagac cagctgaggg gctgtgggcc cacacagacc tggacaagca tgttcacttt 20220 

gggaggaggc aggctgaccc ttcatagata atctctttca aacagagagg caggggagga 20280 

aagggacgca gctcaggcca ggggtggtga gttaggacgt tgtcctgcgc aagggaagtg 20340 

gctgtccctg gtgccgtggg gcagtgtgct tcctgaaggt tccacgcctc agggaaagat 20400 

gaaagtgggt gtcacaaagg tgttttttct aatgatacct tttttttcag attgtaagag 20460 

tattttatgc ttatcataga aaacttggac tatgtagaaa acataaagat acaaattact 20520 

cataaccttg aagcccagaa gcaactcccc atattttggt gtatttcctt ctgtgtttgt 20580 

tgttgttgtt ttgagatgga gtctcactct gtcgcccagg ctagagtgca ggggtgcgat 20640 

ctcaagtcac tgtaacctcc acctccctgg ttcaagcaat tctcctgcct cagccttttg 20700 

agtagctgag attacaggcg tgcattacca cacctagcta atttttgtat ttttagcaga 20760 

gacagggttt catcaggttg gccaggctgg ccacaaactc ctgacctcag gggatccacc 20620 

ctccttggcc tcccaaagtg atgagagtac aggcgtgagc caccatgccc agcttgtatc 20880 

atattttaag agttacattt attatacaga gtcttctctt tgaaaattaa actgatgatt 20940 

ttgaaaaact aacagtggca ttgttagcca gaatttctca ccaacccagc ctgctgattc 21000 

cagagcagtt ggaatgacct ggagggggct caggcggccc acacggctca ttgttgggtg 21060 

ccagggatag cccagagget gctggggcag agcgtgcaag ggaggaggcc ggttggcccc 21120 

gtgttccccc agctgcacac accttctgat gtgcagttgt gctttctgat ttgaaaagaa 21180 

atacatgctc atgtggaaaa ttttggaaat gttcaaaagg agaaaaacat ccagaccccc 21240 

gtgccagtcc tcctgccctc tgcttggttg tgccagtgtc ccaggccgtt gggggtaaag 21300 

ggaggaaggc agctgctgac cagcggtaaa acctgctctc cctctccccg ctgtccaagc 21360 

cttccttaga aagggctggg ctgtgtgtga gacagaggaa agctctgtag acacaggggt 21420 

ggccctgcca gtgcacccag gctggtgact gcagtgagtc caccctgggt tcaagacagc 21480 

tctgcctctt tctgactgtg tgaccttgag tgagccagaa cacaggcctg tgcctccggg 21540 

tctgcacctc cgagagaggg gaggaacagg aagaggc^gc cttcttcccg ccagcai:gra 21600 

gggggcactt gggggcgtct ggtgggagac cagtctggcc tcccagctgg agagtgggcg 21660 

tggcgcgatg tcctcttgag tcagatgtcc atgcatcctt ccctggtaac caagtcctga 21720 

caccttctat gagtctcgct cgaaagcctt ccaggcagag ttggcagggt cagagtggca 21780 

ccgaaagcct aggcagggca cacaaggcct ttgaccacat cctatccctc aggcatcacg 21840 

tccgagaacg aacgggaggt gaggcagctc tcci:ccgaga ccgacctgga gagggccgtg 21900 

cgcaaggcag ccctggtcat gtactggaag ctcaacccca agaagaagaa gcaggtggcc 21960 

gtggcggagc tgctggagaa tgtcggccag gtcaacgagc acggtgcgag gattcaccct 22020 

gggcaccagc cttccctggg cgccagcctt cccacaggag ccaggacctt cccatagggg 22080 

ctigggacctit: ccctcagggg ctgggccttc ccacaggagc cgggaccttc cctgtgagga 2214 0 

cagggccctt ccttgtgggg accaggggac cagaaccttc ctgtagagac cgtgccctag 22200 

cggtgaggtg tgtgggtggc attttgacag catctgccct ggctcaagtg ctcactcatt 22260 

gaataaacca gagggtattc tgcccagtgc tctgtgacca cgtctaccaa tgggacggac 22320 

tgtgtccatc accaagtggg agcacgctac gtggtgctga gtccacccca gctactggag 22380 

gtacagaggt gtggcccctg ctctgcctgc cctgggggcc ctatgatccc cagaacgtag 22440 

gatgcccctg gaactggcgt gccctaggaa cagtgcgcca gtttctggtg ggctgcaggg 22500 

cacgaggaga tagtcaactt gtctgactgt taatccaccc tgtcccctgc agatggaggg 22S60 

gcgcagccag gccccgtgcc caagtccctg cagaagcaga ggcggatgct ggagcgcctg 22620 

gtcagcagcg agtgtgagtg cagcccctgc cccgtctcac ccatgcctcc cagcctgcac 22680 

ctgcagggcg acctctcctt cctgtgcgac tccatcccgg cctgccctat ctcacccgtg 22740 

cctcccggcc tgcgcctgca gggcgacctc tccttcctgt gcgaccccat cctggccctg 22800 

ctaggatctc aggcggtccg tttggggctc agtgttctgg acgctgggag tagaccctgc 22860 

ccacctggag cgcacgcact ggagggaagg cagacccagg acagaaacca tgatgtgcca 22920 

gtccctcttg gacaaggaaa tactgggggt ggggactggc gggggggtcc tgagtgggga 22980 

aggatgagga gggcattagg ggaaggggct ccgggcagag ggaacagcat gagcaaaggc 23040 

ccaggagacc cagcagggtg tggaaggggc tctgccccag cgtaggccct ttgtaggtag 23100 

gcagggaaca gagtgggcag agcctaggtt gggggataga gtctctgggg cacactgagc 23160 

tgggggtgcc tctgggatgt ttgtggtcaa agactgagaa gagacgtcac atcctccacg 23220 
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tgcagctggc 
ctgggagctc 
ggggggatgg 
atttttcttt 
tgaaggacct 
cacagatccc 
cctcgcctct 
ggagacaaag 
ggaccagaga 
agtgaggtgc 
ggcagagtct 
acacctgctg 
ctgtgtcccc 
tatcagaacg 
cagaaagggg 
ccacgttatt 
tcttgtcctt 
tagagccctc 
gggtccctag 
ctatgaggac 
gtagggagan 
ccgcaagccc 
cctggctgtc 
acctgccact 
ggtgctgccc 
cgcccactat 
caccctggct 
ctggaggccc 
ccctcctagg 
gttggatgga 
ggtaagtcgg 
cccatggcag 
catgttggat 

gaggggttgg 

gcggcttgtc 
gcactggagg 
ggccccaggc 
ggggccgggg 
cagcctggtc 
agctcctttc 
ggcagtgggg 
tacatcgcgc 
agcagcctgt 
gacctgccac 
ccaaggactc 
cgcctaacaa 
cataaggatg 
aggggcggtg 
gcagtgtagg 
gcagatttct 
aaacaaagtg 
atctgcaagg 
ggtgccattg 
gcggggagcg 
ggccatgtgg 
tagatgtgtg 
gacccaggat 
gctgcgggca 
ggaggaactg 



cagcctcact 
ctgtcccgga 
gccaagaccc 
ctgggtgtca 
cgcccatcct 
accttcccac 
gcgtgggcgg 
aaacacctca 
ggctgtgaca 
ctctgggcag 
ggaccccaac 
ctgggcctgt 
tccgccaggc 
ttcccatcag 
cgtctgtcat 
ccaccaaagc 
gtctgacttg 
tcccccacca 
actgatgtcc 
aggggccttc 
gtgtgggtgg 
ccagcccgcc 
ccatggtcac 
tgtgtgggcc 
cgactcctct 
cacggctgcc 
tgtctccctc 
tcctggtcac 
ggatcatgtt 
gggatcagtg 
cccatggcag 
acacttgttg 
ggagcggCtt 
gggcgcagat 
acccgtgctg 
tgcatgttgt 
ataaggagct 
gccaggagtg 
cccaaccctc 
ttagcttggc 
ctgcagtgtg 
tggatgactt 
tcctgcagga 
tgcgt:ct:gaa 
gcgcacctca 
agagtgtctt 
tgtcctcggg 
ttgggcagga 
gggcagtggg 
ccatgtatgg 
ccgtgcccgc 
gctcagggag 
ggcacacctg 
ggaagcccag 
ccctatggcc 
cctgtctcac 
ccccaagctg 
gtiatcatgca 
ccatcattgt 



gtcttccatc 
gaagatagac 
aggtctgcca 
gtttcccctt 
agaagcctct 
tgggcccgrg 
acatgggtgc 
catggaagtg 
gcgtcctgcc 
gagcagccca 
ctcacttccc 
cgctgttgct 
cctcagggac 
gcatcatctg 
ccccttctct 
ggctcgtcta 
atctgtcagc 
ggcccggggc 
tgtctctcag 
cttgtgggga 
cattggtgac 
caggctgcat 
ctcacagctc 
cccccaaggc 
cctcttgcac 
atcttggatg 
tgccctgccc 
catctcacca 
ggatagaggg 
ggcacgtagc 
gacgcatgtt 
gatggaggtg 
agcaggtgca 
catgttcgat 
tgagaagccc 
aaggcttggc 
aggcagagag 
gaggctggca 
aggccgccca 
cccacgccac 
gggcgcccat 
tg^ggagatc 
cgacgaagat 
ggtgagtgac 
ggcagggcac 
acagccgtgc 
agagaggccc 
gtgccaggat 
gcgggccgga 
tccagagtgg 
caggccaggg 
cgaggccagt 
gaggacatga 
cagtgcgggc 
tctcacagag 
tctgcagaga 
aggccagcac 
ggcaccctgg 
attcagatgg 



cacgtggggc 
aggcaggagg 
cagactcttc 
ctgaacagtg 
gtggcatgta 
aggaaaccgt 
ttgcgctgct 
ggaagaaatc 
accgaggtgg 
gcctgggtct 
agtcccacag 
gggggcgagt 
ggggccccag 
tttctctgcc 
cgtcctccgt 
ggccaccagt 
agcgtgggca 
ccttgctgcc 
gacaggacat 
ccagggtctt 
cactctcgtg 
ccagcccgtg 
agtctacagt 
tgcctcatct 
acacagctga 
cagcctgcct 
cagcatggtc 
cggcaactgg 
gtcagtgggc 
aggctcttgg 
gaatggacgg 
tcgcaggcac 
cagtaggccc 
ggagcggttg 
cacgctggct 
aggtgaaccg 
ggacacctgg 
cttggcaaac 
gggaagggtt 
cgtccccagc 
gctgttttct 
accaagaagt 
gatgacgagc 
caagaccccg 
cttccaggaa 
cgctggtacc 
ttcttttctg 
ggcaagcaag 
cgcagaattt 
aggccagtgc 
aggacgtgca 
gtgggagcgt 
gcagctttgg 
cctactcctg 
ttgctcacag 
ccaccaggat 
agggggcccc 
ggttctgtct 
gcgcgtgagt 



agactctgca 
cttgggggct 
ttgtgacatg 
agatgatcag 
gggcagagta 
ccatagggtg 
gttttcctgc 
tgcgactcta 
catggtgcct 
ccatcctggg 
gggtttgtga 
taggagtcag 
cacccccatt 
agcttttcca 
cctttgctct 
gacccccctg 
gagatgactg 
acactggctg 
cctgcccaag 
cccgtiaggga 
tgtcagacgc 
ctccttccac 
gagccactga 
ctttgctgca 
tccattagga 
cctccctagg 
ctttctttcc 
ggtccttcag 
acgtagctct 
tggaagatgt 
gttagcaggc 
atagtaggtg 
tgggtggatc 
gcagggtgct 
gaagaggagg 
ggtgaaaggt 
ggtgggggac 
ctgccccctt 
tcctccacct. 
ccatngctct 
ctcatgcttc 
acgccaaggg 
tggcggggaa 
gtcaggccgg 
gctgcaggtg 
tttgggtcat 
cgccgtcagg 
gggcccctgg 
catctagact 
cttgaagatg 
gaggcgccca 
ggtgggtgac 
cggcaggagg 
gcctgacagc 
ttcagctcgc 
ttccagaagc 
agcagatgat 
tctgaactga 
cagggccagg 



gtctgaggca 
cagtatcaat 
ggcaggtctc 
acaaggggac 
caggtgggag 
tggtcagagc 
tccatttggg 
ggaagcccga 
gctgtcctga 
gggcactgca 
tgagccgggc 
cgctgccagc 
gttacccctc 
acagaactcc 
caggcctctg 
cccagccact: 
ctcttctcct 
ctccttcttg 
gtattcatcc 
ctgtgctctg 
cgtcctgaac 
atctcctgta 
ggcctgtgct 
gctacgcccg 
ggtctgtcag 
gtttctgcag 
catccttccc 
aggcctcatt 
cgcggatcac 
tggatggagg 
atgtagtagg 
tgtggcagat 
acgtcgaatg 
gccctacagg 
gtgtcagcat 
gtgggtgagt 
agcacaccca 
cctcctcacc 
gaacccactic 
gtgtgagggt 
agccaagaac 
cgtcatcccc 
gagccctgag 
agcctgcctc 
gggaggttcg 
catctatcgt 
ctcagaaacc 
gtctttctgt 
tcatcacttia 
ccttccccga 
ctgcccctgg 
acttgtgcag 
aaagcgggga 
ttgttttctt 
atggttgaat 
ttggcctcct 
tgtcacagag 
ctgagaacct 
accaggagat 



23260 
23340 
23400 
23460 
23S20 
235B0 
23640 
23700 
23760 
23820 
23860 
23940 
24000 
24060 
24120 
24160 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
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ggtcgggcca 
gagaaggtca 
aggagcagct 
cagacaggtc 
ggtggaaggc 
cggtctgcag 
catggttttt 
gtgccatcat 
aaagacaccc 
cttgggccag 
tataataatg 
tggtgagtct 
gggtctcgtt 
caacgtgcca 
atgtgccacc 
attgttatct 
acactgctgg 
agagatttaa 
gccgtcagag 
gcaatgaatc 
ttgtagaaca 
acccttactc 
tgcaagctgt: 
catttttaac 
agcagaatct 
tcggtgtggt 
cccagacaga 
tttcgttctt 
actgaacttt 
attggggctt 
ctccacactg 
gtcttttaaa 
ggctttgagt 
ggcggtcgtg 
gctgttcgca 
tgactcgtgt 
ccaaacaaac 
tggaaatgcc 
gcgcctctct: 
cgctttaatc 
catgaagcag 
tccgtttctt 
catgcacaca 
tggaagactc 
gctgttggca 
ccctccttgg 
ctggtgcttc 
ccctcagtgg 
aggcgtccac 
gctttgtatg 
ccctccagtc 
tgcacagacc 
tgaggctgct 
aagagctctt 
gggaaggcag 
ggtgcacacg 
cacgtgtgtg 
tgtgtgtgtg 
aargcaggta 



ccaactgtca 
gagtaggccc 
tcagcccaaa 
cgagtgtgag 
tctgctcctc 
aggtcctgag 
actgatggtc 
ctccttacta 
ccgaaatgag 
gcggccatgg 
cctccttgca 
agtgttgtgt 
ctgttgcaca 
aactcaagtg 
acacccagct 
acgctggtct 
gattcaggcg 
gctgttttga 
cactccgatt 
gtcacaagcg 
aggagaggtc 
ccacaaagaa 
catcgtagat 
attacataat 
cgtctcggct 
gacgctgctt 
gagctctgca 
tttccatctc 
ttaatcccat 
taaaaagggg 
tccaagcaga 
ggccctctca 
aagacaggat 
cgtgaaactg 
aggcccctgt 
agctgttgat 
cctggagtag 
ctcaaaccct 
gctcatgtag 
gtttatttgc 
cagagctgtt 
ccatgagcct 
ctcataacac 
agtcttctgg 
ggtgagggcc 
ccttctggtg 
cccgaggccc 
gccctggaca 
ttcctggtgt 
ccacgccatg 
cctctgcccg 
aggaaggcat 
cccagcatgc 
tctgacctta 
cctttggcct 
tgtaggggtg 
taggggtggg 
taggggtggg 
ggttgcgtgt 



ggcaggcagc 
gccctccggt 
gagcagcagg 
ccccccgagt 
ccacttggga 
acacgggcca 
tgctcagagc 
atgggagatg 
agcgagggtc 
cccaggtgtc 
tatgcgtcct 
agcatctgaa 
ggctggagtg 
atcctcccac 
gatttttttg 
caaactcctg 
tgagacactg 
caagaagtaa 
cctttgcagt 
tcccccaggg 
ttcactcaaa 
tgttccagac 
agaatcccct: 
tggaagatcc 
cccttcccca 
cctggtttct 
gtgttctatt 
aggagttgaa 
gtcagaggaa 
atcgttccct 
ggctagacgc 
accaggtggt 
tt-cagatcca 
gtggtgagca 
acggggatita 
tttagccagc 
gttggtttcc 
gtagtgcctg 
aaggttgttg 
rcgrtataaa 
cccccacgtg 
gtccttctca 
ttccgtttcc 
ttgtttccag 
acattctggt: 
ggcctggtgg 
ggagctgatg 
tgcagccagg 
tggtccaggc 
ctgtgtgagc 
gctgccctgt 
gagggctggc 
ctccctcagc 
gcactgcatg 
tgactatggg 
ggtgtgcatg 
ttgcgtgtgt 
ttgcgtgtgt 
gtgtgtgcag 



aggctgaaag 

gggggggaag 

cacagcagcc 
gcccagagct 
caggctgtat 
ggacggccaa 
tggccccagg 
aggaagctgg 
acaggtgaac 
ccagtttatg 
gttcggaaaa 
aatcccattt 
cagcggcacg 
ctcagcctcc 
aaaacatctt 
ggtcaagcga 
tatccagcct 
acaagactct 
ttgccttctt 
ctgagagctg 
gtgaaaatga 
tccacacagt 
gaaaacaaac 
gacacgggaa 
gcctgccttc 
catggcagca 
catagaggca 
tttgtttgga 
ggtcttgggt 
ggtcatcgag 
catccaacgt 
tgtgtgtgtt 
ttt^t.accaa 
gtggcgggcc 
actgagcagc 
tggcctggcg 
atraaargccc 
gtcttccctg 
accccagaga 
acagagatita 
tagacgtgca 
tctgtggaag 
tccatgagcc 
gtgtcctggt 
gt:t:gt:caccr 
agagaacctg 

gggggctcca 

cttccctacc 
ccctggcact 
actiggcacca 
gttctggcag 
cctgggtggc 
cctctgctcc 
cagagcaggg 
gaggtagggg 
tgtgagggtg 
gtgaatgtgH 
gtgaatgtgt 
gggtgggttg 



gtcaccgtcc 
agcaggaacc 
ctgctgcagg 
gcgagtcctt 
gtccagggta 
gaacttgaac 
caagcatatt 
ggtgctggca 
gggttaactt 
ccatctggtg 
ttatgtcacc 
atttatgtta 
gtcacagctc 
caaatagctg 
gtagcgatgg 
gcctcccact 
gaaagtcctt 
gtttcccttt 
agaagtctct 
Ccccaccttt 
cagacacctc 
tctcttctgg 
rtaatatctt 
catttttgcc 
ttcacgggtc 
gaagggccga 
gactcctcat 
ccttctttgc 
gtacattgca 
ttcaggtggg 
aatgttctag 
ctaggcaaaa 
ggccaccccc 
aggccacctt 
ctgccggctg 
tcatactaca 
tctggctctc 
gccatctcct 
gagctagggg 
agrggactgt 
tgcacacact 
gctcagtctg 
tgcccctctc 
cctccagagc 
agttgccatt 
ccccagggga 
gcctggatgt 
cccatgtggg 
tctgaagatg 
tggccacttt 
ggctcagggc 
agcgctgctt 
tcttggctgt 
gagcaggacc 
tgcactggtg 
cacgtgtggg 
ataggggtgg 
gtgtaggggt 
cgtgtgtgtg 



ccgagccagg 
caggtgctgc 
gactcggagc 
tagggtggca 
cgacaccgac 
caagggagca 
cacagatcgg 
cggttttggc 
ggagaagacc 
taagcactat 
cagtitatgga 
cttaargaca 
actgcagcct 
gactacaggc 
gggtctcact 
ttggcctccc 
tctagaaggg 
agttttggat 
tgtgagggag 
tagragcctt 
tttgtgagcc 
agcctggagg 
aaaatcgtaa 
cacaggtttg 
cgctgggcca 
tcaccacagg 
agtttttaag 
agaaaaggag 
gggagcagaa 
ggaccagccc 
aatat.tcctr 
gaatgtgcat 
ttgctaccgt 
tagttgtcct 
acgcacatgt 
ggggcttgcc 
tggcaagcgt 
ggcagtgtga 
agctgttagc 
cacaaatara 
cctaaacact 
tgtagacatg 
ctctcatctg 
tgatagccag 
cacctgggat 
tttgtggggt 
gtcatgtagc 
gccgccccct 
cacagaacct 
gtcccccatc 
ctctgagttc 
caccccatcc 
ggtgaaggcc 
cacaggtcgg 
tgtatgtgta 
ggtgggtgtg 
gttgtgtgaa 
gggttgtgtg 
aatgtgtgtg 



26820 
268B0 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
3 0120 
30180 
30240 
30300 
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taggggtggg 
gtgtgaacgc 
tgtgtgtagg 
gggtgggttg 
gttgtgtgtg 
tgrgtgtgtg 
ctccggccat 
tgggagagtg 
cgctctgcca 
cccttcatgg 
ccagctccca 
gagggaagtg 
cacatgcatc 
ccaggcccag 
aggggggagg 
ccttttgccc 
gtcccagcct 
catcatggag 
gtccaccatc 
cctcaccatc 
ctccatggtg 
caccctcacc 
taacttcggc 
caccttctcc 
caccggcrtc 
gcacagggcc 
cttgaattgg 
cctggtcgtc 
cttccgcatg 
ggtgtgcttc 
gctgctccgc 
cggcctggcc 
caagatcgca 
ggccagcttc 
catcctggtg 
gggcatgaag 
acgcgcccgg 
ccacagggtc 
cgccgagncc 
cggcgaggcc 
tcgccttaag 
tgagattacc 
ggacgacgcc 
cagcctgcgc 
caagtggcct 
gcccaccagg 
gaagttcgcc 
cacgaggagc 
caggcatgca 
gaccatgtgc 
tctgaacacc 
aatgacatgg 
gatgagattc 
gttggatttg 
ttgtttggtc 
gcttgagcct 
tggcagtgcg 
ggtagtgggt 
ccccgtgggg 



ttgtttgcgg 
aggcaggttg 
ggtggggttg 
tgtgaatgcg 
tgtgtatggg 
tgcacgtgtg 
agggagggca 
gtgctgggcc 
cccctccggg 
aggctcaggg 
gacaggcatc 
gggagcccgt 
tagtcacgct 
aagagggagg 
gaggaccact 
agaggcaggg 
cgttcccacg 
atcaaggagt 
atccccacgc 
gacttcttcg 
atctgcaccc 
gacctgccgc 
tggaaccacc 
ttccccatcg 
tttaccgtga 
ctggccaccg 
ccctacctga 
ctcaatgtca 
gcacagc&ga 
atgtggtgtg 
gcctccatcg 
ccggtgggcg 
gtcaccgtgg 
tctgtggtgg 
tggctcacgg 
gcctacaact 
aaggagacca 
acgtggaccg 
gccatcaaca 
taccctgcct 
ctgctggcca 
gtgggcatgc 
accaaggaca 
cagggcagcc 
gtcttcgagc 
cggcacgtga 
ttcgacttct 
ttccagtgca 
ccagtgccat 
agattgcatg 
aagtgtgttg 
gtgtgccagg 
cctctccccc 
tttaaaaacc 
actgctiacac 
gtcacgtggt 
tcacactgag 
gaatgtgtga 
gtggctgtgg 



gtaggttgcg 
cccgtgtgtig 
ttcgaacgcg 
tgtgtgtagg 
gtgggtttgt 
tgtacgggca 
gctcacccgg 
ctctgcctgt 
gactgcctgc 
aggtgacatc 
tcagggctcc 
gggtccctcc 
ggtagaaggt 
gctcacaggg 
aggatggggc 
tggtcagagg 
taccatcttt 
acctgattga 
accacatcaa 
ccttcttcat 
tcaaggtgtt 
tgcgcttcga 
tggagcccta 
ccagcaagga 
ccagctacct 
aggtcaccgc 
aggc cc ttgg 
gcgtcccgtg 
ggaatttcaa 
agctctccgt 
gctacttcct 
tgctgcagtt 
cggtctgtag 
ggatggtgaa 
ccatcgtgct 
ccacactgac 
acatggcgcg 
gccgcttcaa 
tgctcccgtt 
gcagccctgg 
agcacccctg 
cattcagcag 
tcgtgctgcg 
tcategagtt 
tcaaggccat 
agatcgagca 
ttttcttccc 
tgttgccatg 
ctgtgcccac 
gaccccgaca 
ggaattgcat 
ctggactagg 
cccaccttca 
aaataagcat 
cttagcagct 
c a aggc c egg 
ctgggcacca 
aggtcttgcc 
gggaggtccc 



tgtgtgtgaa 
tgtaggggtg 
ggtaggttgc 
ggtgggctgc 
gtgaatgcgt 
gaaggtggcc 
cagccccgtg 
gtctgcactg 
ggcttctcct 
cttgctgtag 
gtcatctctt 
agtgctggag 
ggggagccag 
actgcgagca 
tggtgatggg 
gaggcgtgag 
cccccaggtg 
catggcctcc 
cgcgctcatc 
cccgctggtc 
ccaggacagc 
gcccaacctg 
tgcccatttc 
ctgcatcccc 
gage ct gage 
cggcctgcta 
ccagaccttc 
cctgctctat 
gggcacctac 
ggtcatcctg 
cttcctcttt 
cgcccggtgg 
tgtgcccctg 
gtccctgacg 
gttctgctgg 
ctggcagcag 
cacccagatc 
gtacgtccgc 
cttcatcggc 
caacacctcc 
ccacatcaag 
cggcgctgac 
ggccagcagc 
cagcaecate 
cagctgcctc 
cgactggcgc 
attectgtcg 
aggc cct tec 
gtgtgeagac 
aagggaaggc 
gccatctcca 
aggttccggt 
ageaccctgt 
ctgtgtaacc 
cttccccttt 
ccccatcaga 
caggctgcct 
tgaatccatc 
tgccagtgtt 



tgtgtgtgta 
ggttgccgtg 
gtgtgtgtga 
atgtgtgtgt 
gtgtgtaggg 
tggeagctge 
cagctgaacg 
aacccccttt 
cetcgtaagg 
tgcagagctg 
catgtgggtg 
gtcttgcagg 
gcacggggea 
tggggagggc 
aaaacgcaag 
atgggagcag 
gtcaagtacc 
agggcaggca 
ttcttcttca 
atcttctacc 
aaggcctggg 
gatgtggage 
ctgctctctg 
tgctcggagc 
acccatgcag 
tcgctgctgc 
atcaccgtge 
gtetacctgc 
tgetacettg 
ctggagtcca 
gcceteccca 
ttcacgtctc 
ctgttgcgct 
cggagctcca 
ttctatgtgt 
tatggtgcgc 
ctctgcagcc 
gtgactgaca 
gactggatgc 
acggccgagg 
aagttcgacc 
ggctcgcgca 
gagttcaaaa 
ctggagggcc 
aaetgcatgg 
ageaccgtgc 
gcggcctgag 
ceagtgtggc 
tgtggctgca 
tgetgtgtag 
ceetgagcct 
gtctggaaaa 
tccctctttc 
tccacagtag 
cctgggggat 
ggetggggga 
catgaccctc 
aggacttggg 
tagaagagce 



ggggtgggtt 
tgtgtgaatg 
atgtgtatag 
gtaggggtgg 
gtgggttgcg 
ttctcagggt 
gtatttttgc 
ccctaatgct 
atcgccattt 
ggteacaaaa 
aecetgaggg 
gagagaagca 
gaggggggct 
cacgtggaga 
ggtgcgggtt 

tgggggtcct 
ccctgcaegc 
tgcactggct 
tcatcagcaa 
tgtccttcat 
agaacttccg 
aggeegaggt 
tcttcttcgt 
tggctgtcat 
agecctacac 
cctccatgcc 
ctgtcggcca 
tccatctctt 
tgceetacct 
ccggcctggg 
tcctggtggc 
tggagetcae 
ggtggaccaa 
tggtcaagct 
acegctcaga 
tgtgcgggcc 
acctggaggg 
tcgacaacag 
gctgcctcta 
aggagctctg 
gctacaagtt 
gccgcgagga 
gcgtgctgct 
gcctgggcag 
cccagctctc 
atggcgccgt 
gatggtccgc 
cccagcccga 
gagaccttgc 
ctctgtccac 
gacctttctg 
gcactttaca 
tttcttttgt 
catttcttat 
gtgcacggca 
ggcggcacat 
ctctccagca 
aaacagagaa 
tgactgtgtt 



30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
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cagtgcctitg 
cctgcagcgc 
atcagcagag 
caaggttgac 
agctggaggg 
tgtgcagaga 
ctactgagcg 
agcctacagg 
cacctcctgt 
catttggggg 
ctgtctgtcc 
acctgctcct 
tgctgtgggt 
tgcaggatgt 
cactgtcctc 
ctcaggtggt 
ctgcgtgaaa 
ctgggtttct 
ttctgtggat 
cactgactgc 
agcatctccc 
acggcaccga 
gcagagcagg 
cagcatccac 
ctgtcttttg 
acccctgtgc 
aaaacggaaa 
tttgtttgtt 
ctcggctcac 
gccgaccacc 
ccgtcccggc 
gagggaccgg 
gtcaggaaag 
agtttccaag 
gtccattctt 
taaagaccag 
catcatgcaa 
gaaccaggac 
tcccaatcag 
tctcatccgg 
tgcttcctct 
tgtgtgcttc 
tgggaggggt 
aggtagagaa 
ccacattaat 
ggctgttgca 
aaaggtgtcc 
gatgttgcag 
ctacccagaa 
gggtgtgaga 
caaggaacgt 
tccagagaga 
attcccatgc 
aaacctttgg 
gtctgtccct 
gcacccaggt 
gagtcactca 
ccacaccttg 
agtgactttg 



gagcagaaag 
tttccttcct 
cctgcacccc 
gtttcacagt 
gcccctgggg 
gacctgtgct 
gaa^gtgcac 
gtgattgatg 
gttccctaac 
cacccgagca 
cagccccttg 
gtcccccgta 
gggaagcgtc 
ggttgctcca 
tcatcctggt 
gtcatcactc 
ccacacgaga 
ttcccaggca 
ttgtigaagcc 
agaggaggga 
tcacaacacc 
ggttcagagt 
aaggacagag 
aattagtttc 
ggcagagtcc 
cacccggcac 
gcctgggcta 
tgttcgaaat 
tgcagcctcc 
ccacccagga 
cacctgctct 
gggcagggtc 
accgaattgg 
ccctctggaa 
gggtagcgaa 
tcattcaaag 
gtacactccc 
ct:gt:c^ct:ca 
caggatgggg 
cagctcgggg 
tcatgacaaa 
actcttcctg 
gtttgtgggg 
aagtgctaca 
ggagcaccca 
aaatagacac 
aggtcctaat 
atgggattca 
gggcccaaag 
ttggaagatg 
gggtgtctct 
cacagctgac 
ccccacccct 
gctcctgctc 
ct:ct:gaggct 
aaagataaag 
taactcgatc 
gaggctggca 
gaaagtagct 



ccagggccct 
ttctttaccg 
tgttgaatga 
tgatggcttc 
catctgccgt 
gaagg&gacc 
ggccataatt 
gcgtgggggc 
ctgcagtgat 
gctgcaggcc 
ggtctttggt 
ggctgggctg 
gagtggtcac 
ttttccttgg 
ttgtcccgcc 
cgagccacgg 
ggctcaggag 
atgagatgag 
atggaaggcc 
agtcgcctga 
tggatgagga 
ggaggccata 
ctctgcctgt 
t:aaccaaccg 
agaatgggca 
cattitaacac 
agtt'agtggt 
agggtctcac 
acatttgttc 
aagaacctgc 
ggtctgcccc 
tcactttgct 
gtggtgccat 
gcccagctcc 
gccatcagaa 
cticaaticatt 
aagtgaggtg 
agtctgccac 
tgtcagggct 
aaagcacact 
gaaaacgaac 
gcacagggga 
agatggacca 
gtcatctacc 
ctatatgctg 
gtccattgtg 
ccccaaatcc 
gttaaagatc 
taatttcaag 
ccaccccgca 
cacgccggaa 
agagtcagac 
ctgcatggct 
cctgcctgtt 
cat:gatgrca 
gtgctttgtt 
aattatttgg 
aaccattigCt 
tggggaacaa 



gagtggctga 
aaaagaagtc 
aatgcaagtg 
ctgccacagc 
aactgcgggg 
atggagtgtc 
atctttgaat 
tgctgtccat 
gaccccatgg 
cctcagcaaa 
gtcttccctg 
ctctctgctg 
acgatctggc 
acgacactgg 
tggaggggct 
gtggtggaag 
gtgtcatcac 
gttggggttc 
ctcgccctgt 
tgctttccaa 
agatgctcca 
gcacattagc 
ccccttcccc 
gcagaggcac 
tggtcccggg 
ctgtccccag 
tttcaaaact 
tttgtcaccc 
tgggctttac 
cgtctgcagg 
ttcaccgtat 
tctacgaagt: 
ctcaagtgcc 
ctaggggcat 
agacctctct 
caagtigccct 
acaaaaacgc 
ctgaggacag 
att:aaacaaa 
ttgacagcca 
actcattcac 
tgcagaatga 
ggtaccagtc 
gtgagcgctg 
gacacatacc 
aagactgcgg 
tgtttgtgtg 
ttgataccgg 
tgtccttgta 
agcttcaaga 
aaggrgagga 
tgggccactg 
cctgagcccc 
atccctgaat 
tctgcgtggg 
gtttttctaa 
agctctaaaa 
aaagttgttc 
aacaaagcaa 



aataaaagcc 
tttcttgtac 
caatttgagt 
gagaccctgg 
tggcctgggc 
agcccagcca 
tccagttacc 
gcgggagtgt 
gattataatc 
cctcacccct 
aggttgagga 
ggcaccaggc 
ttggccacct 
gcggtggcag 
gcgtgaaacc 
ctgcaccccc 
tccgagccac 
agcctcattt 
gactctgccc 
gtgaacaaag 
acatccccca 
acgttatcac 
tgggccacca 
aggttggagg 
ctccactcat 
gcctcagtgt 
tcftagcaac 
aggctggagt 
agagaggctc 
aacagccatt 
ggagccgtag 
agtagcattt 
aagagagggt 
tctttcaggg 
gcctcctttg 
cttccttttt 
acatttctgc 
aatcctgtaa 
cagccacgtg 
ggaccctacc 
tcaaaatatt 
acagagagcc 
agtgaatata 
tgatgctctg 
atgcattttc 
gcggtagaat 
ttcccttgca 
ggagatgatt 
agagggaggc 
tgggggaagg 
garggttctc 
acagggctct 
tctctgaggt 
gttgctgcct 
ccccccgcgt 
aaaggacttc 
gcttcccagt 
ctaaagtgct 
acaaaacaga 



tccggtggaa 
gtgcgtgaga 
tataaaagag 
cttcatctcc 
atgggctgcc 
tcttcagatc 
catgaagtgg 
gtcgcgaaac 
agaacttagt 
cttgggcagc 
cgtggcctgt 
ctcacctctt 
ggtgttgcca 
ctacacccga 
acacgagagg 
acactgtcct 
atgctcgccc 
cctgtcccca 
ccaggatccc 
atgacaacac 
tctacagaag 
atggcacata 
cactgccact 
agaagtggtg 
tc^gccacaa 
cccaatctgt 
accgttttgg 
gtagtgctat 
cccgggcccc 
atticaacacc 
gacacccagg 
gagagaatga 
ttggcgccag 
ggccttcact 
gacatigctgt 
tttgatccta 
agatgcccat 
ccaccccagc 
gcccaacacg 
cgacaaatgg 
cagcacctgc 
cctgccccac 
gcacaatiggc 
cccagtttca 
tcatcctagc 
cgcagccccc 
tggcaggagg 
ctggactttc 
tggagggtca 
ggccatgagc 
gcctgcagcc 
agcgticccag 
gtctcctgga 
tttgcagctt 
aattctaaca 
gctgttgctt 
tgctaggaaa 
gaacttctac 
aaagactcgg 



33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35560 
35640 
35700 
35760 
35820 
35680 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
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gttccctttt ctttccgtcc agaagtgatc aaatcagcta aactcactgt ttcaagcttt 37440 
cctctaacta gatgtgtttt ttcctgccgt cagccagaaa ggccctttcc tccctatttt ^ 37500 
agtcacagaa taacctgtct ctttaggaca gtggggtcct gtgctgaacc ccacccccaa ' 37S60 

agatgggaag aactcgggcg ctgcgtttcg ggaagccccg gctgagatga gtgtataact 37620 

ggggctgaga gccgagagcc cggcccagca agccaggcag tgaggtgacc cggtatcccg 37680 

ctccccattt ctgctggagc tggaagagcc ccaaggacac cagccccatc agagggagct 37740 

gggcccagag ctgggtgggg cagccgacta ctccctgcag aacacctacc gcctgcctga 37800 

cactggcggc tgctttgtgg acatgctttt gaaaacctcg tgagggtgat gttattgtcc 37860 

ccactttatc caggggtacc tgggcacgga gcagggcggt aactcaccct ggatgtctgt 37920 

ggcagcctcc aagatggtcc cagcaatccc caccaccttc ctgccgctcc ttccatgtag 37980 

caggataggt ctgtgcgacc agtagaatat ggcagatgtg acgagatgtc acttctgagg 38040 

tttgcttaca aaaggctgtg gctttcagtt tggctgctct ctctcggatc catcactctg 3 8100 

ggtgaggaag ccatgccgcg agcagccccg tggggaagcc tgcacctcaa ggacctggag 38160 

cctcaagcca actgccacat ggtgcacctg gaaacaggga tggctgccac ggcccacact 38220 

cgcctgcggc tccggggaga cacccagcca gaaccccctg gtgaatccac tcctggattc 38280 

cctaccccca cagagtataa gatgacgaat gtggcl:ttag gct:gttaaat tcgggataac 38340 

cgatacagtg ccaacactgg taagtgacga ggccgcaatg aatcccaggt atcctgtggc 38400 

tgcatctcca cctgcatggg ccatcctgta cctggcaggc cccatcctta gcctctttaa 38460 

aagtccagga accttggaat gtttttctga catctcctag gctgctggat ttaccaggaa 38520 

aagggacgga cccacacctt ttccctgttt gaatccattg catttcttcc aggtgtttca 38580 

atcccttctg actcgatttg ctttaatgaa gcttgaggca agtgcaattt tccgcccatg 3864 0 

gggctgtaac actgttcaaa gtcatcctgg agctaagcta agtggaaaaa accatatggc 38700 

attgtcgcct tctcagcctg acctggcatg cactctcacc ctcatctgtc atgcctcctg 38760 

cttttccacc tggggggctg agaagtccgg ccatcgaaac cttggttcct gccagccacg 38820 

ggagtttgga agctttatca gattcctgaa gcctcgtttc ctcatgggaa .cagtgcaggt 38880 

gaaagcacct tcctctcgga accgggggga agatgagagg aaattaaata gatgtatggc 38940 

cccgcagcag gactggcgct ctccattgtg tctgaaattg gcaggttctt ggtctcactt 39000 

acttcaagaa tgaaaccacg gaccctcgcg gtgactgtta cagt:tcttaa aggcggcgcg 39060 

tctggagttt gttccttctg atattcagat gcgttccgcg ttttcctcct tctggtgggt 39120 

tccccctctc gctggctcag gagtgaagct gcagaccttc acggcgagtg ticacagctca 39160 

caaacgcagt acagacccaa agagtgagca gcaattagat ctatcacaaa gaacaaaaga 39240 

acaaagcctc cacaccacag aagacaaccc gagggtgtca acattactgg ctccagcagc 39300 

ctgcttttat tcccttatct ggccccaccc acatcttgct gattggtcca ttttgcagag 39360 

agctgattgg tctgttttac agagagctga ttggtccgtt ttgcagggtg ctgattggtg 39420 

cgtttacaat ccctgagcta gacacaaaag ttctccaagt ccccactaga ttagctagac 39480 

acagagcact gattggtgcc tttacaaacc ttgagccaga cacagagtgc tgattggtgt 39540 

atttacagtc ccttagctag acacaaaggt tctccaaatc cccactagac tcaggagccc 39600 

agctggcttc acctagtgga tcccccaacc gggctgcagg tggagctgcc cgccagtccc 39660 

gcgccctgcg ccctcactcc tcagcccttg ggcggtcgat gggacccggc gtgccgcgga 39720 

gcagggggcg gcgcttgtcg gggaggctgg gccgcgcagg agcccacggc cagggggagg 39780 

ctcgggcatg gtgggctgca ggtccccagc cctgccctac ggggaggcag ctgacgcccg 39840 

gcgagaattc gagcgcagtg ctggcactgc tgggggaccc ggcgaaccct ccacagctgc 39900 

tggcccgggt gctaagcccc tcaccgcccg tggcgggcgg cgccgaccgg cttttccgag 39960 

agcgaggcca gctgagcccc cgcccacccg gaaccctcgc tggcccgcaa gcgcagcgcg 40020 

cagccccgtt cccgcccgcg cctctccctg cacacctccc cacaagccga gggatccggc 40080 

tccggcctcg gccagcccag agggatccgg ctccggcctc ggccagccca gagagggggc 40140 

tcccacggtg cagcggcggg ctgaagggcc cctgaagcgc ggccagaggg ggcgccgaga 40200 

gcgagcgggg ctgccagtac gctgtcacct ctcaccacca ctgcccttgc ttgctctacg 40260 

gctgctcaga cccaaggggc aagtttctcc aggtatttcc aaaccaggga gggctttgtg 40320 

tttagtttaa gtcaaggagg ctgtcccctt ttcgaggctg tttggatttc tttcagcaaa 40380 

aactagcaga agttattacg caggaacttt ttgtagcata taaagaatga atcccgatag 40440 

gtactgtgat ggttaccgtt aggtgtcaac ttgattggat tgagagatgc ctagaaagct 40500 

agtgaagcat tgtttctggg tgtgtctgtg ggggtgttgc cagaggagac tgaaatttga 40560 

gtcagtggac tgggagagga agacctctcc tccatgtggg tgggcaccat ccaatccgct 40620 

gccagcgcgc crccaactaa gcaggcagaa gaaggtggga gaagccggcc cgccgagrct 40660 

tccaccttca tcttcctcac atgctggatg cttccttcca ctcctcctgc ccttggacat 40740 

cagactacag gttcttcggc ctttggactc tgggacttac- accagtggtt tgccgaaatc 40800 

tgtgctttct gccacagatt gaaggctgca ctcttggctt ccctactttt gaggcttttg 40860 

gactcgggcc gagccactac tggcttcctt ctrccctagc ctgcagacag cctatcagga 40920 
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aactttgctt 
tagatcctat 
caacrtcccc 
gtggaggaag 
acctacgacc 
tggccctggg 
aataatagga 
caaggcccag 
gggccttggg 
ttataaagat 
ccatcattta 
cattcatgaa 
gtgctgggca 
aggaagtgca 
gttccaccca 
ttctgccagt 
agggctgtcg 
aggctgaaac 
ataggaggtt 
aagaagccgg 
ttgctgccac 
gttaccctat 
atcagaaata 
agccattctt 
tgccctgaat 
ccctttccgg 
ttaaagcccc 
acccctt:t:ca 
gctatttctg 
tctcatatgg 
tatcaccatc 
ggattcactg 
tggagaccga 
tgcaacctcc 
atitacaggca 
actcctgact 
agccactgcg 
tttttgtttg 
ttttctcaat 
gatttgttgt 
ccaccaccca 
ctictcctccg 
ggctgctata 
tctctcaggt 
cttctctctt 
ttgctccagg 
aaaaai:caga 
ctggacatag 
attgataaca 
gcaggactaa 
ggctctaaga 
ctaagatcag 
ccagaccagt 
gaaaacctca 
cttcccaagc 
aactcacgcc 
ggagtttgag 
ttagccagtc 
atcgccgaac 



tgtgatcatg 
aacttctgtc 
tagattctgc 
aacagaggca 
gcataggttt 
caggttccct 
cctgcetcat 
cacacagtag 
cacacctcct 
ctgcttttca 
cacaatcatt 
tttatggatt 
tggtcaagac 
gggactgtca 
ggaacccctt 
ccctgaaccc 
aggcgtgtga 
ctactgggct 
gacacaaaat 
ccaaaatcca 
actcccacca 
atggtctaga 
accataaaaa 
gtattccttt 
tctttcttcc 
caacatctta 
tatccctttg 
gctgtttctt 
attttccagt 
cccctctggc 
tcactgggtc 
ctgggccacg 
gtctcactct 
gcctcccggg 
cacaccacca 
ctggtgatcc 
gccggcttat 
tctagttttc 
gtgttaaaac 
ttggaacttc 
gctgtcaccc 
tggtatccct 
acaaagagac 
caccatctga 
gtcgctctct 
tcctgaaacc 
cctaaccgac 
gccaaactaa 
gccctttccc 
caaattagct 
gtctgaacct 
tgcttgagat 
aatccggctc 
cttcaaccca 
ccctacccac 
tgtaatccca 
accagcctgg 
atggttgtgg 
ccaggaggtg 



tgagtcaact 
actccggaga 
aggctgggtg 
ctggggtcag 
gagagacccg 
aacctctgaa 
gggtctatgt 
gcgtcaatca 
aacttttcca 
gggtggctat 
cattcatgga 
tagtccatca 
acgccccctt 
tcagcaaacc 
ggtttggagg 
ttctcacact 
accagagcaa 
gcattcccag 
acaagccata 
ccaaaaccaa 
ccgccargac 
aaggggaggc 
tgggcaacca 
actttcctaa 
acgagatcca 
gcttcctttc 
ctcccacccc 
ctgctaatta 
ttgagataat 
ctccttcaac 
ctgttaaatc 
gaaggcacta 
gtcacccagg 
ttcaagcgat 
cagagacagg 
acccgccttg 
gattactttt 
tccacattga 
tatcagataa 
ccatcttcct 
tggggactcc 
gtttctggga 
cccccaacac 
gtaggtggcc 
ctcttttgac 
accttracaa 
tccatcttcc 
ctttggaaag 
aaaaagaccc 
acaagattag 
cggcatattg 
atttcgcagg 
aaccagtcct 
ctatgattcc 
caaattg^ct 
gcactttggg 
cccatgcagt 
gtgctgtaat 
gaggctgcat 



ctccctgata 
accccaatat 
gagttcacca 
gacagaatgg 
cgttccagcc 
gctcaagttc 
agaatacgat 
gtggccacac 
gcctcagttg 
gattcgtcga 
gccattcatt 
tccattcatt 
ccaacgggca 
ctgctccaga 
catctccatc 
ctccacccca 
cccca^cttg 
acagttaagg 
cagaccttgc 
gatggccatg 
agttt:acaaa 
atgaataatc 
gcagccctcg 
raaactt:gct 
agagccctct 
tctttaaatt 
cagttgccat 
ccttcatgtt 
atgtgtagac 
gagactttaa 
agtgtccatc 
tgattatgtt 
ctggagtgca 
tctcctacct 
gtttcatcac 
gcctcccaaa 
tgtttatgct 
ccactaartc 
gctctcaatt 
gtgccaatct 
ctcactgtca 
tcccagatgg 
agaagttrat 
tiaggccagtg 
ccacattttc 
aatcat-aact 
Ctctaacctt 
gaattcagtt 
cccttcttgc 
aaatcacagt 
ctcctgggga 
ccctgtgctc 
accattgcac 
acctccaacc 
tcaaaaactg 
aggccaagat 
gaaaccccgt 
cccagctact 
gagccaagat 



aacccccttt 
agttaccaat 
ccctttgctt 
ggccagaccc 
tggcctctgc 
cttacctcta 
agagcgcaaa 
ttgtcactat 
gctcctctgt 
gaaatigaagt 
ccttcaatca 
ctccagttct 
gataggtaag 
gtggagacag 
tcctgccagc 
cccatcagga 
aataggagat 
catctaagtt 
tgataaaaca 
agagtgacct 
tgccatggca 
cgccccttgt 
ggctgctctg 
ttcacttttc 
cttggggtct 
aaattaaatg 
gcctcagagg 
ttggaataac 
accatgaaag 
tggacaagcc 
atttctgtgc 
tctttctttt 
gtagcatgat 
cggccttcca 
gttagccagt 
gtgctgggat 
gtgcctaata 
atttgtgaaa 
tctgcttccc 
ggactgttca 
cccgggtgtt 
at^aactggg 
gcaaaccaga 
tgctggctcc 
tatcttgaga 
gaggaaatta 
taagctgtcc 
catggtttga 
ctagggtcca 
ttagaggtca 
taacatcatt 
gatggattag 
ctaggaagag 
tgaccaatca 
tgatcccggc 
gagcagatca 
ctctactaaa 
cgggaggctg 
cgtgccactg 



cacagacaca 
agaaacctta 
tgaggtatca 
cagctccaga 
ctttactgtg 
gaacaggagc 
gtgtgcgaca 
cctataggat 
gagatcagga 
atctatcaca 
tttattcaat 
caaggctcct 
cgggggcctg 
agcagagcat 
ccagctcagg 
gtcatatggc 
gggtaaaatg 
acagggcgag 
gtttgcagta 
ctggtcgtct 
atgtcaggaa 
ttagcatiatc 
tccatggagt 
tccatggact 
ggattgggac 
ctcatggttt 
aacggcctcc 
atgttcaaat 
aaaaatcgtc 
accttcccag 
tgtgtaaaca 
tttttttttt 
ctcagctcac 
agtagctggg 
ctggtcacaa 
tacaggcgtg 
atitratctagt 
aatagaaaat 
cagccccgta 
ctictctaggc 
ccttttgtgt 
aaacaggcta 
attgtatttc 
aggcaggttt 
Cctaaac^ag 
tgacagtgaa 
ttgttcattc 
ctigaaacaaa 
gtctgccttt 
tgcagcctct 
attgtaaaac 
ccaacaccac 
aagacattaa 
gcactcccca 
cgggcgcagt 
cttgaggtca 
aatacaaaaa 
aagcaggaga 
cactccagcc 



40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42160 
42240 
42300 
42360 
42420 
42480 
42540 
42600 
42660 
42720 
42780 
42840 
42900 
42960 
43020 
43080 
43140 
43200 
43260 
43320 
43380 
43440 
43500 
43560 
43620 
43680 
43740 
43800 
43860 
43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
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tgggcgacag agtgagaccc tgtctccaaa ticaactaatt aacaaagtta taatcgaaat 44520 

taacactctg atccccgaat gcttggggag actgatttga gtaataataa aactccagcc 44580 

tccctcacag ccagctctgc ataaattacc ttctccattg cagttcccct gtcttgataa 4464 0 

atcggctctg tccaggcagt gggcaaggcg aacccactgg gtggttacac tttcagcact 44700 

gtgtctacag tctggccagc ctttaggatg ggagggcagg tgaaagacag gcttctcccc 44760 

tcttacagta gtgcctggag cagatcactt ccattgtcct cgctgttggc cccacctggc 44820 

tctgggggac aatcctgtgt ccagataaca actccaatgc gacgtacgaa gggaaggtca 44680 

gatgtcaggg acaaccagga gtccctgcct catgtgtttc cttctttcct ggtttcgtgt 44940 

tagtggcgca caccctccag caacttcccg agaaattgcc tttatgtgaa cgggagccag 45000 

tctgaaaact tcccaacata ggtccatiacc tttgggccat ctcccctcac ggtgacctgg 45060 

gcttgctcct gtgacttgtt ttggcccatg agacgacagg aaatatgaca caagcagaga 45120 

cttgagaggc aggtgtgcct tggggcttat cttactgaca ctgggggctc cagtgtcaga 45180 

gacattggaa ccagagtgac tccatcttga ataggggctg ggtaaaatga ggctgagacc 45240 

tgctgggctg tattcccagg aggttaggca ttcttagtca acaggagatt cacagatggg 45300 

gaacaagtga acgatgttta ccaaacagac cccggactta acagatccag gagatgtrcct: 45360 

gatgtcccga caccttaaga acaaaagcat tcttagttca aaaataagtt tggtggctgg 45420 

gcgtggtggc tcacacctgt aatcccagca ctttgggagg ctgaggtggg cagatcacct 45480 

gaggtcaggg gttcaagacc agcctagcca acatggtgaa atgcatctct attaaaaata 45540 

caaaaaatta gcaaggtatg gtggctggca cctgtaatcc cagctacttg ggaggctgag 45600 

gcaggagaag tgcttgaacc tgggaggcgg aggttgcagt gagtggagat cgcaccattg 45660 

caccccagca ctccagcctg ggcgacagag caagactccg tctaaaaaaa aaaaaaaaaa 45720 

gttttcgctt taaagattct tgcagaagac agtagttgca caaagatgaa caaticctr^ag 45780 

tcaccagccc tagtagcaga gcacacctcc cccaggattt tgtggct.ttg tcttatatat 45840 

gaacaagcac tgtacctaag gcacatgtgt tcctcatctt gctcttggga acaccg^acg 45900 

ctgcctatgg agtagcaact cccticttgcc tttactgtct taataaactt gctttcactt 45960 

tactctgtgg actcgccccg aactctttet tgtgcaagaa ccaaaaaccc tctcttgagt 46020 

ctggattgag accgctttct ggtaacaact ggggcgatca cagggctggc ctgggagagc 46080 

ccgctgggtg agagagatgc ctgctgatgg catctgtgtt gtcctgagcc aaccagcaaa 46140 

cctgtgagtg aggccgttct aaaacattca tcctcagctc tgcctcccag cccacacacg 4 6200 

gaattgtaag aaatagtcaa tgtttttact cttttttttt taagacacta aattctgggg 46260 

tgtttcar^a cacaccaaaa gctaattggt acaaaacctt gcatgggtaa aaagccttca 46320 

tttccctctc acgtgtaatc aatagtttga agggc^catag ccagcattcg aggtccacga 46380 

atttcttctt agagtttgaa agcagcgctc ccttccctta ttgctgctgg gaagcctgaa 46440 

gacatattct gattctccta agctatcgtt tacagtgttc tcagtctccc tgagttctga 46500 

tcctcaaagg gatggccctg gaagtgggta ggctttccac cattgcgccg ggcactgtag 46560 

tttcttttca tctgcaaagt ttgattcttc aggtcagaaa cattttgaat attttatttc 46620 

ctccattttc tttgttgaac tcctcatatc tgtgtgctga ggcctctttg aggaggttag 46680 

ctttgtttgt ttgtttattt atttatttat ctatttattt tgagatggag tctcactctg 46740 

tcacccaggc tggagtgcag tggcatgatc tctgctcact acaagctctg ccttccgggt 46800 

tcaagctatt ctcccgcctc agcctcccga gtagct.ggga ttacaggcac gtgccaccac 46860 

gcccggctaa tttttgtatt tttagtagag acgggtttca ccatgttggc cagcttggtc 46920 

tcaaactcct gacctcaagt: gatcagcctg ccttggcctc ccaaagtgct gggattaaca 46980 

ggtgtgagcc accacgccca gccgaggtta tctctttttc tcacctattt tctttgggat 47040 

tctctttgat attttttgtt ccacttcctg agaaatttcc tcaaccgtat cttccaatct 47100 

ttgcagagat ctttattttt gcaatcacgc ttctaatttc catggtttcc ttagagtctc 47160 

cacgtgtttc cttggctatg ttttccatat tctcttatct ccatgcagat atccattatc 47220 

cgagttttga aaacggtatt ccacctggca ctgtagctaa aggttgcagc ttcttttcct 47280 

tcctctttgc ttgtttgctt gtttttgtct tggtcttttc cacagcaggg ctgtccttga 47340 

gtgtctggca ttctgggctg tctgcctcta tctcagcagc tgcctggaat ctctgtgtgc 47400 

ctgtggcaga gtggggatgc gtgtcaaact gccagcttcc cagttggatg ctttggtgga 47460 

gcagtacctt tttttttttt ttttttttga gatggagtct cactgtcgcc caggctggag 47520 

tgcagtggtg taatcttggc tcactgcaac ttccacctcc catgttcaag ccattctcct 475 BO 

ccctctcagc ctcccgagta gctatgattc taggtgcatg ccaccatgcc tggataattt 47640 

ttgtgttttt agcaacgaca gggtttcgcc gtgttggcca ggctggtctc aaactcctga 47700 

ccactggtaa tccgcccgcc ttggtctccc aaagtgctgg gattacaggg gtaagccacc 47760 

gtgtctggcc gatggagcag tatctcgagg gacctccaag tgtccaaacc taggtctttt 47820 

ctctccagct ggccaatttc ccagtgaggg ggcctttaac cttccccggg gtgtgggtgg 47 880 

gttgggtgtg acctgttctc agttggctgc ctgccgccag gtcttcctgg ggtcccggtc 47940 

cccgcagcct ctcgcacaat ccctggggaa cgaacctccc gccttctgca ggggtgggag 48000 
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agggatgcct 
ccccggtgcc 
tggttgcccc 
cccatgtgcc 
ttgatccctt 
gcagggttct 
taagtagtcc 
cctggtctca 
aatcgaggcg 
cagggaaagg 
tgaggcccca 
ctctgcggac 
gagggactca 
cacagaatcc 
ccgtatagag 
gaggaagcca 
ccagtgggtg 
ccaagccccg 
agatgagctc 
tcaaggccat 
atccatgtgt 
ccaccactcc 
ctctgccacc 
tccccaggaa 
ctacaggccc 
tgagcaggga 
atatggggtc 
tatttattat 
caaggtgtgg 
tttccagctt 
atccagcacc 
agctgtctcc 
ggccccacat 
tgcaagatcc 
aggctgcagg 
ggtttgaatc 
ctcatctctc 
atgctctgca 
ggtgacctcc 
ggagcggcat 
gacgcacatc 
agcgccaatg 
acctgaagcc 
cagggaaacc 
ttgtggcagg 
ctctcccccg 
atgaggagtt 
tttcctcatc 
ctatgcacac 
agcactgngt 
aagataggtc 
aagagctgca 
aaagcccatg 
ttggactcac 
trtattggaaa 
gagacaccaa 
gacaacttga 
tcgtcctcag 
cgactatgtg 



gccgaggaag 
tccaggccca 
caccgcgggc 
ttcttgcttc 
ctttgactct 
ctgggcccct 
ccaaggactg 
catctgtgtc 
cccagctcag 
cgtgagcagc 
ccaggcagcc 
acgctgcctg 
cccacctctt 
tcttgatcag 
cctggtgtgt 
ggtcacccag 
gaggtcaggt 
gacctagcec 
cggagagatc 
ctctgcctgc 
gcagaccccg 
acaggctcca 
atctggctgc 
agagccctiac 
cgctttgctg 
aaatccctac 
cctggcgctg 
cttccagtct 
gcagggctgg 
ctagaagctg 
gctggtcgag 
aatacaacca 
ggataatcca 
cagagct:ggc 
atccatcacc 
ctgacccctc 
taggcctcct 
gaaaggatag 
agcacagcag 
gccctcccac 
caggaagtgg 
aagcccctgc 
cagaatrccc 
caggcgtgcg 
tggggcgggg 
caggctitcca 
atgtgctacc 
t:gtat:catat 
ctgctgccag 
tttgcactat 
gtcttattag 
ttaggagcta 
ggcaticccca 
taaaaacaaa 
aagctggtat 
agaaaaaaag 
ccaccatgta 
agaagagcua 
aacgcccgta 



acatrcagaa 
gagcctttct 
ttgggctctg 
ccarcttcaa 
cgagcctgtg 
gcaggggacc 
tgcagggcac 
cgtgccagtg 
gaggctttcc 
ggctgggaaa 
agaaggtcca 
gggtcagacc 
cacgcttcag 
gtggctgtgt 
cctctgtgaa 
gcttccccca 
cgggcaggag 
agtgggtgtc 
cagggaggga 
cacagctccg 
ctggagcagg 
gccctcacac 
ccctccgtcc 
ccttgccctg 
cttccctgcg 
ctaaaggggt 
ccatgacaaa 
ggaggtcaga 
ttccttccgg 
cctgtgttcc 
tcttcatgtc 
ggaacggtcc 
ggctaaattc 
aactggccaa 
accctgggcc 
cacttattaa 
gttctccatg 
catgcagcca 
gactgcgggg 
tatgtacaca 
ccaccccctc 
cagcagaccc 
tcccgaaagc 
tgggacactt 
cctctctggc 
ttctctggga 
tgtgtgattg 
tctcatggag 
acaaaacaag 
ccaagt^aac 
ctccatttcc 
gagggagcaa 
cccactgtat: 
cagcattctg 
taacat:attt: 
aatgcaccta 
t.cccacacca 
aattaaaaac 
catt:cgagag 



aacctcaatt 
ggggttctat 
ctctctcagt 
ggccacggtc 
ggagcgaggg 
cagtgttgag 
cagggacacg 
ccagggggcc 
aggagcaaag 
acacacacca 
ggatacagtt 
ctggccccaa 
tttccttatt 
ggacggatga 
tactaggaat 
ccccccgcca 
ggtaggtcac 
aggaacgtca 
aagcattgag 
agcgcagggc 
ccaagctgtc 
gcctggccct 
tctgcagccc 
ctcaagaaag 
tgctgtggac 
ggctttgtca 
tgactgcaaa 
ggtctgacac 
aaggtcccag 
taggctggtg 
atgccttgct 
tacactttca 
caccttcagg 
tgttggctga 
tccgcacatg 
ttctactaac 
tgtacaatgg 
caatigagaca 
ttcagtcgag 
cacagccaga 
cctgtgctct 
gcagaatagc 
cagaggcagg 
acccacacac 
ttcctgaaga 
ccctcaggca 
cgagctaatt 
agtgtgagtig 
acaactatca 
cttgtccttc 
tcagtgatga 
acccttgacg 
tgcctcctga 
gacctcgttt 
atagttttat 
tgagttacag 
ccacccccac 
aaaaccaaaa 
taccaggaaa 



ttctccccac 
gatgcaaacc 
ctctaagctg 
cagctcccgt 
tgtttgatct 
cccacataca 
gtggtgagtg 
tgggctgtgg 
ggtgcacagt 
gaatgtcacc 
tagtaattaa 
cacccacagt: 
tataaaacgc 
gatgatgtgt 
gacatggagg 
agccaatccc 
ccaggggcgc 
ggacccgcag 
catcatttct 
ctggtgtgga 
cctggccagg 
ctagaagccc 
cataaaccac 
gcaatgaggt 
ctcccaaggg 
ccaggacgaa 
ctgagaggct 
aggcctcact 
gaggatctgt 
gccccctcct 
gagccttttt 
cggacttggg 
tccatgacct 
agttgggttt 
cctgccactt 
ttgggtatca 
gtataacaat 
tcccttccct 
tcaccccaca 
gaggttagtg 
gtcctggagc 
acccgccttc 
gcctggctcc 
tctcctgttt 
atccaggtac 
cgtctgtggg 
ctttaacctc 
tgtgtggaga 
ttacggagtg 
ctctcccacc 
attcagagct 
tgaacccagc 
tgggctccac 
ttgatgtaac 
tcaacaattg 
agtccaaact 
caccaccacc 
aaaccccaac 
tgtaagcaga 



cctcagaggc 


48060 


agctgccttg 


48120 


tttccactcc 


48180 


tctctttgtt 


48240 


gcagcgttga 


48300 


cctcgctcag 


48360 


aacgatggtc 


46420 


ggaaaggagg 


48460 


tggtctcagg 


48540 


ttccgaggcc 


48600 


gacctcgggg 


48660 


ttgtggccat 


48720 


gcggaaaaat 


48780 


acacagccct 


48840 


gcaggtcagg 


48900 


tggacctiagc 


48960 


ccccgccaag 


49020 


ggctggggag 


49080 


cccagccctc 


49140 


gtgggggccc 


49200 


ggccagcrcc 


49260 


tgctcactac 


49320 


caccagaaac 


49380 


gccaccaggg 


49440 


cccttgcctg 


49500 


gtgcccaggg 


49560 


taaaagagca 


49620 


gggctigaaac 


49680 


ttccttgcct 


49740 


ccatcttcac 


49800 


ctgttgctac 


49860 


tgattagatg 


49920 


taatcacatc 


49980 


ggggaggctg 


50040 


ccacccctgg 


50100 


gttaaattac 


50160 


agcaaccccc 


50220 


gtaggggccg 


50280 


gt:act:gagga 


50340 


agtggccagt 


50400 


tgtggaactg 


50460 


ctcctgctga 


50520 


aggcagggtg 


50560 


agctcagcca 


50640 


cctgtggttt 


S0700 


ggagcactgg 


50760 


tctgaatcag 


50820 


tcatgatggg 


50880 


tcatgtgcca 


50940 


acaaacctac 


51000 


agggaaagtit 


51060 


ttcccagtgc 


51120 


ctccatgagg 


51180 


atttgttatt 


51240 


ggtaatttgt 


51300 


gatcagggct 


51360 


accaacagct 


51420 


aacttcacaa 


51460 


gccaggatgc 


51540 
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aagtctgtga 
ccttttctta 
atccagagac 
ttggtaccca 
gaaggacacg 
aacctctagc 
tcgtagcaaa 
ctatctatac 
tcctttgatt 
tggattttat 
atgacccatt 
c^ttCacaca 
ctgggttatg 
ctgggtctcn 
tccatgagca 
ctaccatgga 
ggargtgccc 
atgcaatttc 
tgaagtggag 

ggggggtctt 

agtctgccat 
gtcgctctaa 
acgggttcct 
cctcagcttc 
agcccagagc 
acctcacagt 
gaaggtgccc 
actgcctcta 
gaatcagatg 
cgcaccagac 
gcatctgcaa 
ctgagacagg 
actcaccatc 
tcccatcagc 
taaaaagtcc 
agaaccctga 
cctcccttcc 
gaaagctggg 
ccgtcctcat 
tcggggatga 
tgtcagagtt 
tgttctcagc 
cactgatgtc 
cccggggcct 
cgaacatgcg 
aggcatcagg 
tcacgcccat 
gttcaagacc 
gaccgtagtc 
agagactgca 
ctgtcccaaa 
ctggaaaggg 
tcacatgtca 
cctctgcccc 
atcccagcac 
gcctggccaa 
tggtgggcac 
aggaggcaag 
gcaaaactcc 



ccattacact 
acacacattt 
atacttttct 
aagtttaaaa 
taagatctgt 
tgcaaataca 
tacccaagta 
aactgaccac 
gtiraaattta 
agtgatatat 
tccctgtgta 
cacaaaagta 
ttatatctcg 
gccccccacc 
aagacgactc 
aacaccacct 
cccgggactig 
ctttcaaagg 
tcctggggag 
ctctgtcttc 
gtcccacttc 
catgagccgc 
gacaccactg 
ttctcgtgga 
ctggcacgct 
caggaggaag 
agt^aatgct 
gggctgagca 
aggacacagg 
ctgggacctg 
attctcagca 
gcgaattgcc 
tgt.ccctgga 
tccggtccct 
tattgatgtg 
actgtiaagac 
tttttatttt 
gcagggaggg 
cagrgccgtg 
cttgcatgag 
taccttgtcc 
cgggtgccag 
atcacgccgg 
gctgccttcc 
gaagaagttg 
tgaggaggag 
aatcccagga 
agcctggaca 
ccagctattc 
gcgagctgtg 
acaacaaaca 
aaaccacacc 
gctgccacca 
aatcatcatg 
tttgggaagc 
catgatgaaa 
ctgtaatccc 
gttgtagtga 
gcctcgaaaa 



ggtccttcca 
gatcctcaca 
tcaaagaaga 
cagaagcaag 
actttgaaga 
agtttctcct 
atgggtaagg 
attctcgaac 
attagaaCag 
aaacaaacat 
aatgtratat 
tactgtaatc 
taaaaaacaa 
ttctgaatca 
agaggggtgg 
tctctgtggc 
tgcggtcctt 
cagttcagtg 
ggccacactc 
caticctgcgt 
acaagacaga 
ccacgagatg 
cacggcacct 
cagaggggat 
tcgagaagtt 
attccaaaaa 
gaagggtgag 
ccacgcct^gg 
cacacagaag 
ggatgtrt:ca 
tgaattcacc 
ccatttataa 
gccctgatct 
gcaacaggac 
gtraaagaccg 
tccacagtca 
tttaaacaga 
ggcccaaact 
actttcttcc 
gctgggtggc 
tggaagatgt 
gccgtgtgca 
cgcCtgcccc 
ctcgaggcct 
ttgtaggccc 
cacaagcccc 
ccttigggagg 
acatagcgag 
aggaggttga 
atttaccact 
aatgaaaaag 
tgcctctatt 
acatgcagac 
aaaataccct 
caaggcgggc 
ccctgtctct 
agctactcag 
gccaagatcg 
aagaaaagaa 
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ccgtttctgg 
atttgctttg 
tcttttcctt 
gaaccaggaa 
tggtaaatgt 
cttaacattt 
ccctcagagc 
ctctttacaa 
atatgtacat 
tcttcattca 
ggccagaagt 
cactgagaat 
aacaaacaaa 
aaatgccaaa 
gcaggtttgg 
cctcattgtc 
aaagtaaagg 
cattggctca 
ctgaagccac 
ctcagtrtctc 
ggcattcctt 
gactggagag 
gccatgggtg 
cctaagggtt 
ggtgtttatg 
atcgtgcatg 
gcggcaggga 
tacacagcag 
acaaacccca 
agacactcag 
atccaaaaaa 
ccaaaagaac 
gggaccagtg 
gcactggagt 
cagaccgaga 
tgtccatttt 
caattaccgc 
tcctgtgtcc 
cctccttgca 
aggggccggg 
acaggttgtt 
ggatcttctt 
ccacgcacac 
ccagggtcac 
cggtcatgat 
ttctcgagaa 
ccccggtggg 
actccacctc 
ggcaggagga 
gcagtccagc 
caatcccaga 
cccctgtatc 
tttggggcca 
cctggccagg 
agatcacctg 
accaaaaata 
gaggctgagg 
cgccactgca 
aagaaaagaa 



ttttgtccct 
gacatcatta 
tcttttttct 
ctgtattaag 
taaatctatg 
gactcacaaa 
tgggctttgg 
gggggaaaaa 
ataataccgt 
cgttgattaa 
gagtacacac 
aactcagccg 
ccaaaaaaaa 
ctacgtggct 
tatcaccaga 
agaggggcag 
tatcctaaaa 
aggagggtgc 
caagcagagc 
ctggacccrt 
tccaaacctt 
aaccttgtgt 
acctgtgcat 
cccgctt:at:c 
gaggaaatga 
ggagtcagtg 
gggctgcaga 
aggctcagca 
ctccgctcct 
aagacctagg 
caacggtgag 
ccctggctgg 
aggcatgggg 
gaaaaattaa 
caggaaggga 
atgatttgtg 
caaacacaat 
acacactgcc 
ttgcggtcgt 
aactgcacat 
ggtggcggca 
ggtgaagtcc 
gcgccgtggc 
gtcccgctcg 
gacgctggtg 
ataagcccag 
aggattgctt 
tacaaaaaaa 
tcacctgagc 
ctgggtgaca 
gcctacagag 
ttggggcctc 
ggagccaaga 
cgtggtggct 
aggttaggag 
caaaaactag 
caggagagtt 
ctccagcctg 
aagaccctcc 



cccctcgaca 
ttttccaggg 
tctttttggt 
acctcctttt 
atgtgacagt 
gggagatgga 
ctcraacacg 
agtaatccaa 
ccaggtcaac 
aaaaaaacca 
gcactttgtg 
ggtctgtttc 
ctggatttgg 
ctgctgcaag 
acaggagcat 
agttcccgag 
tggt:cagaac 
aagcccagga 
gaggagcccc 
gtggtcgcca 
ccatttgaat 
gaagacccgg 
ccgctcagag 
cccagcatcc 
atgaacaccc 
tccgctccgg 
ggg^aagtca 
aatggcaggt 
ctcgggaaac 
agaggagctt 
gtcatcagct 
ctcgtttgag 
gagaagcagc 
cccggacgca 
cgtgaatgaa 
gcggtgaacg 
tctggcctag 
acccccgcag 
gaaggtcatg 
acctagtgca 
acggcaacga 
aagctgtcca 
ttgagcacag 
gtgttccgat 
ggagaaggag 
gtgtgggggc 
cagcctagga 
aattaaaaat 
ccggggaggc 
gagtaaaact 
ccatcactcc 
tcataccccc 
tcctttttat 
cacacctgta 
tctgagacca 
ctggacatgg 
gcttgaatcc 
ggcaatiaaga 
tgtaatttct 



51600 
51660 
51720 
51780 
51640 
51900 
51960 
52020 
520B0 
52140 
52200 
52260 
52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
S3 160 
53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
54720 
54780 
54840 
54900 
54960 
55020 
55080 
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gctagtccag gagagaaagc tctggccttc caaacagacc gtgcaaccac agcaggaggt 55140 

agacatggaa gcagctggga cagaaaacca ccaggccctg cttccctcca ggggcccgcc 55200 

tgtcccgagg acticgggcga ataaacaaac atgcatcgcc atgagggtcg gggcgagcag 55260 

tgctgacgtg caggggcccg ctgcccccac cgtcccgcca aagagaatcc cacaccactt 55320 

ctctgctctc cctgctccta aggagaaggg gttcctgccg gtgggaggaa gggcaagttc 55380 

atccacatag taagaggcag gagagaaagc acagttctgt gggcctgcgc ccagagggag 55440 

ggttttgggg tttggcctct gcagggccct ggagcagctg tgaggtgcaa ggtaactggg 55500 

gaggaagagg atgttacgaa aagcgaccat aaagaagttg tcctgctgaa ggctcttcag 55560 

ggaggaaatc ctgtttcctt ttggaagctg agctgggctt cctgagggtg acactgctct 55620 

gcgacagaac caggtggatc cctgggtctc caggggttat gcacagagga caaaagctca 55680 

cacgtgagct ccggcactgc agtcaggggc ttgggctctg gccagtccgc ctgggtccgg 55740 

atcccagctc tgccctaccc tgacatgcgc actctcttag ttttctataa gaactgggat 55800 

caacaaggga acgcagcccc tggtgagcgt gaatcagggg acataggctg aacaccccgc 55860 

acagtgctgg gcttgaggca ctttgcttct tgttatttac caggtactcc tcgatcctaa 55920 

tcgtccatgt gaccaaagcc gcacccgctt tatatgtgac tcagagcctg ggacaaggga 55980 

agcagcatgc gaggggggcc gggtgaacga gticccacaga agggccctgc ccaggcgcca 56040 

caggctgact gggggaggcc acaccctccc cgcgccagca cagtggggag gcagccccct 56100 

ttgttcttca gggaacagtt cacagaatga acctgggcga tttcacacaa tgctggttict 56160 

tctgatccat ttagtggaag gagagcgaca taattaaagt catttgcaac ccacacagtt 56220 

cctgatcctc cccctgttcg ggtgtggaat tcagctgaca ctgctctggg ggagagggag 56280 

gggcccctct catcctagag caccaggccg caccaccatg agagccattc ctcacac&ca 56340 

ggcaggtttc ttaaattcca gaaggttccc ttaacaggga aaagggcagg ggagctgctt 56400 

cacctctcct agcctcatct gcatgtggga ccacaacgcc ggctgctttc agccccagtg 56460 

ctccgctcct ctttcctttg gggaaagaaa ggcccctgcc ctcgccaccc gtggcgcctg 56520 

taaticccagc tact.tgggag gctgcggcag gagaagtgtit tgactccagg aggcggaggt 56580 

tgcagcgagc cgagatggcg ccactgcact ccctatccct gtgagcggtc aatcacagcg 56640 

ctctcctcgc cccttggcct ggccaacctg ggcatgtgat atccctgtcc atgctgattg 56700 

g^gcaaaaag tgggcaagtg acccacacag agccaatcag atccctccat gagagtit^ta 56760 

cacagtccct gggaggggtt ggctggctga ggataatgga aaggggaatg atttgggggt 56820 

catctctgaa gagggagagg attgaggcca gtgtgtaaga agaagcaggg aggagcggag 56880 

atggggaagc atggagagaa gaaaacggcc ggagagaaaa aaacagccct agggacacag 56940 

catgggcccc tgggggagcc ttgcccagga ctcctcagag caagtgtagg tgtgtttgac 57000 

tcccgcaccc gagcacgctc ggctggcaca aatctctccc cccaggcagg acagatcaag 57060 

gagaacttca tggaggaggg ggcgttgaag cctgttgtgg gagggcctgc gggattcctc 57120 

cgggtaggta acctggggaa gggcaggcct gccaatagga tagtccacgc aggatcactg 57180 

cggcacccct gcgcatacga actgggatct tgctgaaggc atcacacatg gtctcatccc 57240 

caaagcggcc acgaaggcca gtttcctaga accactittcc agccgagctg acttaggggc 573 00 

ttcctcaagc acacggtggc gagggtagaa tgtggaccag agactgagag agtccagcag 573 60 

gagagctgct gcctctccac agcacgctgc ccatctgcgt tctgccggtg ctaggaagcc 57420 

ccacggcagg ggcca^cggg cagccaccac gcggcagcag cggcgctictg cgggccggac 57460 

caaccgtcag agggggtcac caggaataag agggcagcaa ctgggagcct gccaggattc S7S40 

gagtttgctt tggtctgcaa ggatgtggga acctgggggg aatccttgct tctctcctga 57600 

gcctctgagg ccctggaagg caggacccag ggtctgttta tcttgggacc cccgaccatg 57660 

gggcatgttg ttctcagaac agtgcccaaa gtgaactggg aggtggcaca cctctatgcc 57720 

ttctagaaac acccacagcc actggacaca cctgaggtcc ccagtgaagg gctcagaaca 57780 

ggttgagtca tagtagcccc ccagggcacc ccacccattg cgggaggcaa acaaggacag 57840 

aggt:tccagc cgggcagagc agagccagct gtggagcgag agacagtgag gcgggtgggg 57900 

gaccactagg acagggcgtg gctggcgtgt aaagtgcacg caggaagtgg cgggtgggag 57960 

gcaggacagg agagcccagg cctagacacc actgtcagcc cggggatgct tggcggcttc 58020 

tccagtcc^g ggagcaggca t:cacctggcc gcgggtgccc cctggtggca gcttgaagga 58080 

aggacgggca gtgggecgca gccagcgggg acccaccccg caaaacgcac ataaaagctg 58140 

gaaccagctt gtcacagctg caggtccctc tcgtccgatt tggatagacc ctcttgggac 58200 

ccactgcacc agggaacccc aaatgcagct cagcagcatg ggaggagccc tgtccgc^gg 58260 

gggtgtctgg gatctatctg ggggcagggc ctagagagaa ggtcctgtgt cctgctgacc 58320 

cagctgtgcc agcccagccc cgggccgggc ccccaccctg gtct.tctaag catcctttag 58380 

tggccccctg tgatcaccat gccagcccct tgcctgagct cgggctctgc tgagagaccc 58440 

ctgcgggct-g ggggagctcc aggatgttgg caactctctc agaaacctcc ttccctggac 58500 

acaatgactt ggccccccac ttggatccag ccctgcttcc agggggctcc ctcctggtcc 58560 

tgtctctgcc atgticcagcc ctccccrcat cccacacacc aaccctgacg gtcccagccc 58620 
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tatgcccaga 
gccaccccca 
gtggacagcc 
cttgcccacc 
ggtggtattc 
gacccacttc 
gtaggagaca 
aggcttgtcc 
ttgggagggc 
gacacagagg 
tgtccatgcc 
ccggggcgcc 
aagccctgga 
tgtcaccagg 
gtcaaagcca 
accgggtgct 
gggaccctga 
ctgtgaggtt 
rgagagccgg 
ctgagtagcc 
ccctccctac 
caggcacatt: 
cgaaggtagt 
catcctggcc 
tggcccagcg 
cagccctgct 
cacagcccaa 
cctccaccgt 
cacacggccg 
ctgtgacaaa 
ctctgccctc 
ctccaaactit 
caggcctttt 
agagggaacc 
tcacacagcc 
gagagggtga 
acccggaggg 
gcacgggctc 
tcaactccca 
tgaacgccaa 
gtgaagacat 
ccctccatca 
acctccctga 
cccctgggtc 
ttccttcaaa 
gagccatctg 
agcaggattg 
ctgactcccg 
ttgcctcccc 
gtctgtctgg 
tagcactgtg 
atgaatggca 
ctccttactc 
ctctgccgtg 
aaagactgtc 
caattgtctt 
ggaggcttag 
cgtccccagc 
tcccagccac 



gccacagaga 
gaggtctggg 
ccccaccagg 
acactggaca 
tgacctagca 
tgaatctggg 
ctgagaggcc 
atttcagaat 
agaagaggga 
taaggaccca 
cgccctgggc 
actctccagg 

gggtgcagga 

aagagaggga 
caggggtgca 
gggctatagc 
attaactaaa 
cagtgacaga 
acctgctctg 
ccatgctgcg 
ggtgaggtga 
caaacttgtc 
catggacctg 
cttccacaag 
cagacctgct 
catctcatcg 
tacagccctg 
gacacagccc 
acctctgagc 
cagaacgctc 
ccagctgtgt 
aaagcgataa 
gagggtaatg 
cacagagg^t 
tctaagaggc 
caccccagga 
ccgcatgagt 
tgccctctgg 
ggagccgaaa 
tttcgtgtgt 
tttgtagggg 
tgtggggggc 
ggaccggggg 
accagcctgc 
tctcccttct 
actaatatgg 
ggaggaaggg 
cctggggccc 
cttgggcccc 
gacccccgag 
ttgggcacag 
gcaggtttta 
agcccccagc 
cactaagcat 
aatctaaggt 
ccaataaaac 
atgaggcaaa 
cccttcaccc 
tatggtggct 



gtcatagtgc 
agagcccaga 
tgatgcccaa 
ggactgcggg 
gccagtgcag 
cggctggaca 
acctctcagc 
cttacgcctg 
ttctcccacc 
cgggtagagg 
tgagcccagg 
caggtggggt 
gctccaaggc 
gacagtagtg 
gggctgtctc 
cctcgccccc 
catgtticcca 
agatgaagaa 
gaaagcgcgg 
ggtcaccgga 
ggtgcgggcc 
gaaaat:gcag 
gtgggataag 
aagggtctca 
catctcagcg 
ggaggcccat 
ctcatcttat 
aatacagccc 
tgtggccaga 
acagcacagg 
gatcttggac 
gagggcgaac 
atctccgaga 
tctcatccat 
tgaccaggac 
cctgcatcca 
gccagccctc 
gtgcggaagg 
ttaacaactc 
cagcttggct 
aggttaacat 
cttgtccaat 
attccgctaa 
ccgcctgctc 
ttctcagtct 
tggcaaggac 
cctgctattc 
tgctcatccc 
tgggctgcag 
ggtgaggccg 
ggtactggct 
gtgttcgtgt 
agccctggtg 
ctcctgccct 
ccgctgcttg 
gggatgggga 
gcatactatg 
ttgccttctt 
gccagggacc 



ggcagctctc 
cctgtcatgt 
gtcagggctg 
gaggaggcga 
tctttatctc 
aagaggcatc 
ttctgagatc 
gcaggaccct 
tccagaatgg 
tcaggtcagg 
ctagggagac 
tgagagacgg 
gcccctgagc 
gtgatgggag 
aggacatiaca 
gacaagctigg 
ctactgccca 
tttcatgtcc 
gcttcaccca 
atcccagccc 
gaggtcaggg 
tcgttctcgt 
ggatgaggtg 
aagagcagcg 
agggtctgaa 
gaccaggcac 
cggggggccc 
tgctcatctt 
gacgggagta 
ggccaggacc 
aagtgatttc 
ctcacagtgg 
acacagggtg 
ccacagcacg 
ccttccccag 
cctgactgca 
tgctgaggct 
gaaggcaact 
caacgaacac 
gggccaaggc 
ttaagctgat 
cagtcgaagg 
cagatgcgtg 
tgcagattta 
ctatccccat 
tacacaggaa 
actgcacaca 
tccccagccc 
ccccaccaag 
aggttcttcc 
cagtatgtgt 
cccacccccg 
atgcacgcgt 
ctagtggcag 
ctggctctgt 
gaataccggc 
gacttacctc 
caaccacaga 
cattctagac 



acatgaccac 
tgactgatgc 
ctggctctca 
cagtacatgg 
cacaccgtgt 
ctgtccttga 
gaaacccaca 
gagcacaaag 
aggaagaatc 
agcccctgcc 
acaggaactc 
ctggacttgt 
cccagcgggc 
gaggggtgag 

ggtgtggatg 

gagaggggag 
tiagggcaggc 
tgccccttga 
gacacctgga 
agacccctgc 
cctacctgtc 
acagggaaca 
agtggacggg 
agggtctgca 
tgctctgaca 
acggctgacc 
acgaccaggc 
atcgggggcc 
gcaccaggga 
acccggggtc 
atctctctga 
ttctggcatt 
gcggccatca 
aggcactgag 
gattcccgta 
gccggtccta 
ggagctgcca 
gggcgccatg 
agaaaatgtg 
accagatact 
atactgagta 
ccctgataga 
tgggctcgca 
gaacatgcca 
cctgttggtt 
aacagagtgg 
ggatggttag 
ccgtgtcctc 
ctgagggtca 
ttctctctgg 
aacaactaaa 
gccctgcatt 
ccaaatgcgc 
ggtgaggaag 
cacagatttg 
cccacctccc 
agtgtggctc 
ggctgttaaa 
aatgaggtgt 



cacagtgcca 


5B680 


catccacagg 


58740 


ctccaccccc 


SBBOO 


gctrccfcgaca 


5B860 


atca^c^gga 


5B920 


ggaagcagct 




gatggagagt 




ctga^g^gat 




ccagagaaga 


59160 


acagccccct 




acticctggca 




ggat>t:gcgac 




ccgagtgggg 


CQA AA 


tagaccgatg 


59460 


gat:gat:ggtg 


59520 


cacatcaccg 


59580 


gccaacgtigg 


59640 


atctgagcgc 


59700 


cccattgagg 


59760 


agggcagaaa 


59820 


gctcccgttc 


59880 


gagctt:gctc 


59940 


gcgtcccgac 


60000 


tgccctgaca 


60060 


cagcccgaca 


60120 


ccgaccgt:ga 


60160 


acacggc tga 


60240 


cacgaccagg 


60300 


gcagagtcca 


60360 


cagat.cct:gg 


60420 


gcctcagttt 


604B0 


aagtgcaggg 


60540 


agaaagggga 


60600 


gctcacaggg 


60660 


ccccacagga 


60720 


caagccctgt 


607 8 0 


ggacaacagg 


60840 


gagcc^ggtt. 


60900 


aagggtggtg 


60960 


c ggc gti ti t: c t 


CI rt'5 ft 


aagcaggctg 


610BO 


acaaagacag 


6114 0 


tagtcacccc 


61200 


cgtgagctaa 


61260 


Cugc^ ucccu 




cactagggga 


61380 


acaaggctac 


61440 


ctggtgaccc 


61500 


ctgtgttcct 


61560 


g^ccaggg^c 


61620 


atcaaggggg 


61680 


tttgtgtgtc 


61740 


atgctgttct 


61800 


agctctggag 


61860 


ccatt:t:aact 


61920 


aggacagctg 


61980 


atiticaacaac 


62040 


ccattaatct 


62100 


aagaagtctt 


62160 
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ctgagaagca 

gtgagatcca 

ggcagccact 

atggtggact 

accttcctgg 

taagcaccgc 

ccaccaaaaa 

caataggtca 

tcattcccct 

ccgaaaatgc 

ctcagggatg 

cctgagctgc 

ctgtccctct 

ctgcctgggg 

actcaagtaa 

gtggcttcca 

ctctgagctc 

gtcttcgcac 

gcccagccga 

ccaaatagga 

agcacgtggc 

ggggtgtttt 

caaatgccac 

aagccatccc 

tctaccccct 

gaagggccca 

gaaacatgtg 

cgcttaaccc 

ccatcgtcta 

ttcctccctg 

aagctccctg 

agttgctgga 

gcctggctag 

gggggtctcc 

ggtcccagac 

acttcacgtc 

caggctcttc 

catggggttg 

gtcattcatt 

attctgtgct 

gcagctggct 

tttgaagcca 

atgccagatt 

actgggcact 

ctctgttcat 

cattgccatg 

tgtatgccat 

gcttatctgg 

agagtgggca 

cttcctgagc 

atctgtctca 

ttgggccagg 

agagaacagg 

atgacaacct 

ctgctctggt 

ccgagaggcc 

gtctgtccca 

caaggtggac 

ctctgcaggg 



ttctggggaa 
gtgcatttcc 
ttgccaccat 
ggatcacaga 
actgccctct 
acaccagccg 
atgttggttg 
gctgggccag 
ccttcgtggg 
tcatagaaca 
gaggtaagtg 
ctccagaggg 
cctagtctgg 
tgtgcagtgt 
acacacg^gg 
ggagcgtgag 
tgcaccagtt 
ggaggaaagg 
gagacacctc 
tgatggctcc 
accccatgtg 
cccacccatg 
ctcctccaag 
acatttcttt 
aacatatctg 
ggcttcacct 
ctcagttcac 
tctttaatgg 
catgctccac 
gaattgaggc 
tgaggcaaga 
tttcttcagc 
gaaggccccc 
tgc^g^ggtg 
cttgactgta 
ggacacggag 
aaagaccgtg 
gcacgacgga 
cattcctcaa 
gtgcagaaat 
aaaacccatg 
aagctatact 
gcaaaggtgt 
gcttgtggct 
ggtgtcctgg 
gggtttgtaa 
ccacrgacca 
tccctaaagg 
atgaggactc 
tgcagaagtt: 
ctggtagcag 
cactaactcc 
gtggtggagc 
gtccagctgt 
ctctacccca 
cctccctgct 
gccccttatc 
actgtacctg 
cccactggat 



gatcttiacaa 

tatttcctgc 

gaggtaccac 

aggaaccact 

ccagactccc 

aacgcagtcc 

ttgtgctgct 

atgccaagtg 

gtccttcatg 

cattccacgt 

accggacaag 

agccagagag 

ggtctctgcc 

gaggaggggg 

ctgacaaaca 

ctgttcctct 

cccagcgagg 

aaagcgtagg 

cctcatctcc 

ttctcaccgg 

tgaggagtga 

tttgcacggc 

aagcctccct: 

caagtccaca 

aatggatgat 

agactccagg 

ctctgcgtgc 

tacagctccg 

tggggcccag 

ccctgggccc 

ctgagcactc 

ctccgtgtct 

tccatgggga 

cccacctggt 

aggtagcccc 

gagatgattt 

gagacagaga 

tgactctgtt 

acctagaact 

agccattact 

cctggggagt 

tgatgtgctg 

cctgacattc 

ctggcactaa 

gcaggcatgg 

caaaatcaaa 

tctccttttt 

caacggtttg 

cccagacagt 

cccaaagtgg 

tgaccctggg 

ctaggacctg 

rgacagccac 

ccccatggaa 

gaaacaggga 

ggcccctctc 

ccctcatggg 

gccactgcct 

gccctgatgg 



rcacagtgaa 

ctggaatgta 

atgtgaaggt 

gtggcgctgc 

tgttgtggga 

tgctacacac 

tttatgtagt 

cccagcctac 

agaattccag 

agtttcaggg 

gggccggagg 

ttgctcctgc 

attcccacag 

cctgagggag 

ggcggtcccc 

ggggcccagg 

cctgcgccac 

gctctgagcc 

ctacc&ctca 

gacagtaaag 

cacggtgagg 

cagatcttac 

gat:ctcccca 

ttttattcta 

aatagcaaga 

cttataggcc 

actgcttacg 

atgcaggctc 

cctgacgctc 

tggactgcag 

ctgccctctg 

tcacctacat 

tctcagcata 

aggtctctat 

gggtgagcat 

ccgagaagaa 

agcaatggcc 

ttgcacctgg 

aaacacttac 

atccccattc 

cagcgccgga 

ccctctaccg 

tcaaatgcgc 

ctggccctgg 

ctgagcccta 

tctgactgcc 

tcttggagca 

ctgcccacct 

gtcccctccc 

gacaaggtct 

gctgaagtag 

ggccacagag 

tcaccagcca 

accggctgag 

agcagagctg 

agccacaagg 

aaatggcagc 

ctccagaaat 

actcattcat 



acacgcaaca 

ggcaaagtgc 

ggaaaaccaa 

cggggtcatg 

gaaaaat.agc 

aatttctagc 

tattgctatc 

ggcctcacgc 

aaagctgtgt 

gttca^gggc 

gggttggaaa 

aggcttgagc 

ggacagagga 

gagctggggc 

atagcggtcc 

tcagatcaga 

ccaggacgct 

tgggctctga 

gtgcccgagt 

aatggaggtg 

aaccggctgg 

ccatccttca 

cctaactgga 

acaccacatg 

taataaataa 

tgtgctgagg 

gatgcaccct 

ccagggagtg 

atctctgttc 

gccttgctca 

ccaagctacg 

actgggcata 

gggaaaaaag 

gggtcttgcc 

gtagcggccg 

tgagcggtta 

gtcactgcgc 

gcccatgccc 

cagggaacaa 

atagccaaga 

agaaggtggt 

ccccctactg 

tggacaatgt 

gaccttgggc 

cttataggtg 

acagagttgg 

cctgctctgc 

ggacaattat 

cactgccccc 

ctaagctcat 

cgtggggact 

agacccaggg 

atctggaaat 

tggcagcctt 

atcacagcac 

gctcgtgtcc 

accccattcr 

acaactcctg 

tcattcattc 



gaccccaggg 
ttggaggcgt 
cgcccaaatg 
gagccccccc 
cccgattcgt 
acatcacaag 
attttagcat 
cccagaacag 
ccccctcccc 
ctttgctgac 
tgttcatttc 
cgggc^tatc 
gggctgagtc 
agtcgcaaga 
ttgtgggtgt 
gcagcccacc 
ttcctactag 
a^ctgtcacc 
tccctttcca 
acagatgtrca 
ggtttctggt 
aggcccagca 
accagctttg 
agttttgttg 
tactgtccaa 
atctggaaag 
gggagggcaa 

gggagggaag 
ccacatctct 
gggacatcca 
tgtggggatg 
tgaaagccac 
acccgggatt 
tccatgttca 
ctgtggctga 
ctggggtcct 
tgctcccgcc 
tgtttattca 
gcctggtaag 
aaatggagga 
agagtcggga 
gtgatigctga 
cctctaccag 
cagtagttat 
ggttttgttt 
tacagtcCcc 
acaggaagtt 
agactgggac 
ataacctggc 
ttctgctggt 
aggacccctg 
aacgatggga 
acctaaacct 
gcctgctggc 
cctgaggctc 
tgcctcccag 
ccctcctgtc 
tccagctccc 
atttattcac 



62220 

62280 

62340 

62400 

62460 

62520 

62580 

62640 

62700 

62760 

62820 

62880 

62940 

63000 

63060 

63120 

63180 

63240 

63300 

63360 

63420 

63480 

63540 

63600 

63660 

63720 

637 80 

63840 

63900 

63960 

64020 

64080 

64140 

64200 

64260 

64320 

64380 

64440 

64S0O 

64560 

64620 

64680 

64740 

64800 

64860 

64920 

649B0 

65040 

65100 

65160 

65220 

65280 

65340 

65400 

65460 

65520 

65580 

65640 

65700 
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tcagtcaacc 
agcataaagc 
caagtggcgc 
gataggtgcs 
gtctgagtga 
agggacggca 
aaggagtgag 
cggagagggg 
ggaggatgaa 
ccgacggagg 
catcctgatc 
aagctgctta 
caaatcagag 
aaacacctgc 
ctctccacag 
catggcactg 
gatcaggaag 
ttagaatgct 
ccttgccagg 
ctcaatctcc 
ccacaagggc 
gaggccgct:c 
ccccgtgtgg 
aggagccaag 
ccaccctggg 
cacgtgctgc 



aatgcaaaca 
cactgttcct 
cctccaccgc 
ctggtagttt 
agttggggag 
cacgcaaggg 
cagggggaac 
tgaggaagtc 
atctgagtca 
aaggaagttc 
ccaggccact 
cagggcagga 
caggcagcag 
ctctgaagca 
cctctggtga 
caccgggtat 
gagggaggga 
atcaggctgg 
agcccaaccc 
ctccatcgtt 
acatrgaagg 
gcagagggag 
ccttgaacgc 
ggcgctcttc 
tacaggcagg 
ag 



gcacctaccc 
gtgcagccaa 
cagggagcag 
aggtgaacag 
agattggtgg 
ccctgaggca 
agggacgcaa 
accattaggg 
aggaggtagg 
cagcacccgg 
gcacgcaggg 
aaaggctccc 
gaggaaggtc 
gacccctgaa 
gcagtggcag 
cctgagggca 
gcctagttta 
caggtgttta 
acccgtccac 
cacaggggcg 
gaggaccaga 
ctgggagaga 
cttcacctcc 
aggaaggggc 
gccgcaggca 



tgtgccaggc 
cattctcctt 
caagtgctgg 
ggcctccgtg 
cgatgcctgg 
gggacacgcc 
ggggccaccc 
aaacgggagc 

gaggggcagg 

tgggccaggg 
aggtgagggg 
acgtttatgt 
ccccccacct 
gctcaggaga 
ccttcaaccc 
ggtggtcgct 
tccacraata 
acgaaactca 
acgcccccag 
cccttctcag 
aagaaaatca 
acaggctgcc 
ccgagcgctg 
aggaggaggg 
gcaaagatgc 



cctgggatga 
gcagatacag 
ggaggaagag 
ggaggtgctg 
agaggggtgt 
aagggccgtg 
acgcaggatc 
cactgagggt 
caggaggctg 
ctgcagagcc 
aggtgagagg 
tcatgcctct 
ctcttgagag 
tctatatccc 
cacacctctg 
ccttccagtc 
agtggtgtca 
aaactgacag 
tccccaggca 
tggcccgagc 
gcaccatggg 
agggtctcaa 
gccttgctcc 
ggttgcagcc 
tgracagagt 



tgcagtgacc 
acaataaaac 
ccagcaggca 
tttgaccaag 
gctgggcaga 
caggcagaga 
tcatgggcca 
tcagagacag 
gagrcccctic 
caggtgtcac 
agggggaggg 
ggtatccccc 
caaggcctcc 
tgtccttagc 
ccctcaaagc 
ctggcg^caa 
tattttggaa 
taggcgcctc 
tgcgtgcgac 
tttcctcttt 
aaggaagggt 
tccccgagcc 
^ccctiacaac 
aggcctggca 
gagcccagtg 



65760 
65820 
65880 
6594 0 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
66720 
66760 
66840 
66900 
66960 
67020 
67080 
67140 
67200 
67212 



<210> 2 

<211> 3688 

<212> DNA 

<213> Homo sapiens 



<400> 2 

gtgcagaagg ccgcgctagc cggctcttca gcagcgagtg cagattgctc ccccgcggcc 60 

gcagatctcc cgcttgcgcc gcgttcagct gctcccgaac aacttttctg ccggcccaga 120 

ggccccaggg cgtcgcagcg ccgcgtgcgg cccactcacg ggccggcagg atggactcca 180 

acactgctcc gctgggcccc tcctgcccac agcccccgcc agcaccgcag ccccaggcgc 24 0 

gttcccgact caatgccaca gcctcgttgg agcaggagag gagcgaaagg ccccgagcac 300 

ccggacccca ggctggccct ggccccggtg ttagagacgc agcggccccc gctgaacccc 360 

aggcccagca taccaggagc cgggaaagag cagacggcac cgggcctaca aagggagaca 420 

tggaaatccc ctttgaagaa gtccCggaga gggccaaggc cggggacccc aaggcacaga 480 

ctgaggtggg gaagcactac ctgcagttgg ccggcgacac ggatgaagaa ctcaacagct 540 

gcaccgctgt ggactggctg gtccccgccg cgaagcaggg ccgtcgcgag gctgtgaagc 600 

tgcttcgccg gcgcttggcg gacagaagag gcatcacgtc cgagaacgaa cgggaggtga 660 

ggcagctctc ctccgagacc gacctggaga gggccgtgcg caaggcagcc ctggtcatgt 720 

actggaagct caaccccaag aagaagaagc aggtggccgt ggcggagctg ctggagaatg 780 

tcggccaggt caacgagcac gatggagggg cgcagccagg ccccgtgccc aagtccctgc 840 

agaagcagag gcggatgctg gagcgcctgg tcagcagcga gtccaagaac tacatcgcgc 900 

tggatgactt tgtggagatc actaagaagt acgccaaggg cgtcatcccc agcagcctgt 960 

tcctgcagga cgacgaagat gatgacgagc tggcggggaa gagccctgag gacctgccac 1020 

tgcgtctgaa ggtggtcaag taccccctgc acgccatcat ggagatcaag gagtacctga 1080 

ttgacatggc ctccagggca ggcatgcact ggctgtccac catcatcccc acgcaccaca 114 0 

tcaacgcgct catcttcttc ttcatcatca gcaacctcac catcgacttc ttcgccttct 1200 

tcatcccgct ggtcatcttc tacctgtcct tcatctccat ggtgacctgc accctcaagg 1260 

tgttccagga cagcaaggcc tgggagaact tccgcaccct caccgacctg ctgctgcgct 1320 

tcgagcccaa cctggatgtg gagcaggccg aggttaactt cggctggaac cacctggagc 1380 

cctatgccca ttccccgctc tctgtcttct tcgtcatctt ctccttcccc atcgccagca 1440 

aggactgcat cccctgctcg gagctggctg tcatcaccgg cttctttacc gtgaccagct 1500 
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acctgagccc 
ccgccggccc 
ttggccagac 
cgtgcctgct 
tcaagggcac 
ccgtggtcat 
tcctcttcct 
agttcgcccg 
gtagtgtgcc 
tgaagtcccc 
tgctgttctg 
tgacctggca 
cgcgcaccca 
tcaagtacgt 
cgttcttcat 
ctggcaacac 
cctgccacat 
gcagcggcgc 
tgcgggccag 
agttcagcac 
ccatcagctg 
agcacgactg 
tcccattcct 
catgaggccc 
ccacgtgtgc 
gacaaaggga 
gcatgccatc 
taggaggttc 
ttcaagcacc 
gcatctgtgt 
agctcttccc 
ccggccccat 
accacaggct 
tgcccgaacc 
tccctgccag 
tcctgagtgg 
aaaaaaaaga 



gagcacccat 
gctatcgctg 
cttcatcacc 
ctatgcctac 
ctactgctac 
cctgctggag 
ctctgccctc 
gtggttcacg 
cctgctgttg 
gacgcggagc 
ctggtcctat 
gcag^atggt: 
gatcctctgc 
ccgcgcgact 
cggcgactgg 
ctccacggcc 
caagaagtitc 
tgacggctcg 
cagcgagttc 
caccctggag 
cctcaactgc 
gcgcagcacc 
gtcggcggcc 
ttccccagtg 
agactgtggc 
aggctgctgt 
tccaccctga 
cggtgtctgg 
ctgttccctc 
aacctccaca 
ctttcctggg 
cagaggctgg 
gcctcatgac 
catcaggact 
tgtttagaag 
ctigaaataaa 
aaaaagaaaa 



gcagagccct 
ctgccctcca 
g^gcctgtcg 
ctgctctatc 
cttgtgccct 
tccaccggcc 
cccaccctgg 
cctctggagc 
cactggtgga 
tccatggtca 
gtgtaccgct 
gcgctgtgcg 
agccacctgg 
gacatcgaca 
atgcgctgcc 
gaggaggagc 
gaccgctaca 
cgcagccgcg 
aaaagcgtgc 
ggccgcccgg 
atggcccagc 
gtgcatggcg 
tgaggatggt 
tggccccagc 
tgcagagacc 
gtagctctgt 
gcctgacctt 
aaaagcactt 
tttctttctt 
gt:agcat:tt:c 
ggatgtgcac 
gggaggcggc 
cctcctctec 
tgggaaacag 
agcctgactg 
agcctctggt 
agaaacaa 



acacgcgcag 
tgcccttgaa 
gccacctggc 
tcttcttccg 
acctggtgtg 
tggggctgcc 
tggccggcct 
tcaccaagat 
ccaaggccag 
agctcatcct: 
cagagggcat 
ggccacgcgc 
agggccacag 
acagcgccga 
tctacggcga 
tctgtcgcct 
agtttgagat 
aggaggacga 
tgctcagccc 
gcagcaagtg 
tctcgcccac 
ccgt:gaagtt 
ccgccacgag 
ccgacaggca 
ttgcgaccat 
ccactctgaa 
tctgagtgac 
tacagatgag 
ttgtgttgga 
ttatttgttt 
ggcagcttga 
acattggcag 
agcaggtagt 
agaaccctgt 
tgttcagtgc 
ggaacctgca 



ggccctggcc 
ttggccctac 
cgtccccaat 
catggcacag 
cttcatgtgg 
ccgcgcctcc 
ggccccggtg 
cgcagccacc 
cttctctgtg 
ggtgtggctc 
gaaggtctac 
ctggaaggag 
ggtcacgtgg 
gtctgccatc 
ggcctaccct 
taagctgetg 
taccgtgggc 
cgtcaccaag 
gcgccagggc 
gcctgtcttc 
caggcggcac 
cgccttcgac 
gagcttccag 
tgcaccagtg 
gtgtagattg 
taccaagtgt 
atgggtgtgc 
attccctctc 
tttgtttaaa 
ggtcactigct 
gcctgtcacg 
tgtgtcacac 
gggtgaatgt 

gggggtggct 

cttggagcag 
aaaaaaaaaa 



accgaggtca 
ctgaaggtcc 
gtcagcgccc 
ctgaggaatt 
tgtgagctct 
atcggctact 
ggcgtgctgc 
gtggcggtct 
gtggggacgg 
acggccaccg 
aactccacac 
accaacatgg 
accggccgct 
aacatgctcc 
gcctgcagcc 
gccaagcacc 
atgccattca 
gacatcgcgc 
agcctcatcg 
gagctcaagg 
gtgaagatcg 
ttctttttct 
tgcatgttgc 
ccgtctgtgc 
catggacccc 
gttgggaatt 
caggctagac 
cticccccacc 
aaccaaataa 
acaccttagc 
tggtcaaggc 
tgagctgggc 
gtgaaggtct 
gtgggggagg 
aaagccaggg 
aaaaaaaaaa 



1560 
1620 
1680 
1740 
1800 
1860 
1930 
1960 
2040 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2560 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3688 



<210> 3 

<211> 3512 

<212> DNA 

<213> Mus musculus 



<400> 3 

cggtttcgga gcaacttcgc ggcgggctgg gaggcccagc actgcggccg gcaagatgaa 60 

ctcaggcacc ccacctccga gcccctctgg cccacctcct ccacccgcac cacagcccca 120 

ggcccgggcc cggctcaatg ccaccgcctc actggagcag gacaagattg aaccgcctcg 180 

tgctcccaga cctcaggctg accccagtgc tggacgaagt gctggggaag cagccgcccc 240 

ggagcctcgg gcccctcaaa ccggcagccg ggaagaaacg gacagagctg gtcccatgaa 300 

ggcagatgtg gagatcccct ttgaagaagt cctggagaaa gccaaggctg gagaccccaa 360 

agcacagact gaggtgggca aacactacct acgccttgcc aacgatgcag atgaagaact 420 

caacagctgc tcagccgtag cctggctaat cctggcagcc aagcagggca ggcgggaggc 480 

cgtgaagctg ctgaggcggt gcctagctga ccggaaaggc atcacttctg agaacgaggc 540 

tgaggtgaag cagctatcct ctgagaccga cctggaaagg gctgtgcgca aggctgcccc 600 

ggtcatgtac tggaaactca accccaagaa gaagaagcag gcggctgtgt ccgagctgct 660 

ggagaatgtt ggacaggtca acgagcagga tggaggggtg cagccaggcc cagtccccaa 720 

gtccctgcag aagcagaggc gcatgctgga gcgcctcgtc agcagtgaat ccaagaacta 780 

cattgctctg gacgattttg tggagctcac caagaagtac gccaagggca tcattcccac 840 

caacctgttc ctgcaggatg aggatgaaga tgaggacgag ctggcaggga agagccccga 900 
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ggacctgcca ctacgccaga aggtggtgaa gtacccttta cacgccatca tggagatcaa 960 

agagtacccg attgacgCag cctccaaggc cggcatgcac tggctctcca ccattgtacc 1020 

cacccatcac atcaacgccc tcatcttctt cttcatcatc agcaacctaa ccatcgactt 1080 

cttcgccttc ttcatccccc tggtggtctt ctatctgtcc tttgtgtcca tggtcatctg 1140 

cacgctcaag gtgttccagg acagcaaggc ccgggagaac ttccgtactc tcaccgacct 1200 

gctgctgcgc ttcgagccca acctagacgt ggagcaggcc gaagtgaact tcggctggaa 1260 

ccacctggag ccctacatcc acttcctact gtcagtcgtc tttgtcatct tctccttccc 1320 

gctggccagc aaggactgca tcccctgctc ggagctggcc gtcatcccca ccttcttcac 1380 

ggrgaccagc tacatgagcc tgagcagctc tgctgagccc tataccaggc gt:gccc&ggt 1440 

caccgaggtg gctgccggct tgctgtccct tctgcccacc gtgcctgtgg actggcgctt ISOO 

cctgaaagta ctcggccaga cttcccrcac tgtgcccgtt ggccacttca tcatcctcaa 1560 

cgtcagcctc ccctgcctgc tctatgtcta tctcttttac ctcttcttcc gcatggccca 1620 

gctgaggaac ttcaagggca cttattgcta cctggtgccc tacctggtgt gcttcatgcg 1680 

gtgtgaactg tccgtggtca tcctgctcca gtctaccggc ctgggcttgg tccgggcctc 1740 

catcggctac ttcctcttcc tctttgccct ccccatcctg gtggctggcc tcgccttgat 1800 

gggcacggtg cagtttgccc gatggttcct grcgctggac ctcaccaaga tcatggtcac 1860 

cacggtgatc tgcggcgtac ccctgctttt ccgttggtgg accaaggcca acttctcagt 1920 

gatggggacg gtcaagtccc tgacgaagag ctccatggtg aagctcattc tggtgtggct 1980 

aacggccatc ctgctcttct gctggttcta cgtgtaccgc tcagaaggca tgaaggtcta 2040 

caactccaca ctcacctggc agcaatatgg cttcctatgt gggccccggg cctggaagga 2100 

aactaacatg gcccggaccc agatcctgtg cagccacctg gagggccaca gggccacgtg 2160 

gacaggccgc ttcaagtatg tccgagtgac cgagatcgac aacagtgctg agtcggccat 2220 

caacatgctc ccgttcttcc tgggcgattg gatgcgcrgc ctgtatggcg aggcctaccc 2280 

atcttgtagc tctggtaaca cgtccacggc agaggaggag ctctgccgtc tcaagcagct 2340 

ggccaagcac ccctgccaca tcaagaagtt tgaccgctac aaatttgaga tcacagtggg 2400 

catgcccttt ggcaccaacg gcaaccgcgg ccatgaagag gacgacarca ccaaggacat 2460 

tgttctccgt gccagcagcg agttcaagga cgtgctgctg aacctgcgcc aggggagcct 2520 

catagagttc agcaccatcc tcgagggccg cctgggtagc aagtggcccg tcttcgagct 2580 

caaggccatc agctgcctca actgcatgac gcagctgtca cctgcccgga ggcacgt:gaa 2640 

gatcgaacag gactggcgta gcacagtgca cggtgccctc aagtttgcct tcgacttctt 2700 

cttcttccca ttcctgtctg ccgcctgagg agcgtccgcc gctggaggag gctttggtgc 2760 

atgttgctgt gaagtccttc cgtgtggcca cccagccagc tggagcagca ctgtgccgtg 2820 

tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tgtatgtgta ttctgctctc gtgtggttag 2B80 

atcccaggcc ctctgctcac ctgtagatcg cagatcctgc tggagggtgg ttctctttag 2940 

cactgtccac tttgaatgcc gagtgtcata agaaattgca tgctatcttc actcacaatc 3000 

ctgcccttcc ctcacagagc tiggaacccca agcctggccc caaagacccr ccatcacgaa 3060 

gagcacttta cagataagaa tcttcctccc cagttcttca tgcctccttc ctgccctttc 3120 

cttactttgt gttggatttg ttttccaaat atgtgtgtat gtagacttca tggtagtgtt 3180 

tcttatttat ctggttgctg ctaagccttg acagtggttc accttcctgg gctgtcccca 3240 

gtggtcacgt cctgcctggc ttcctacttg ggtatagcat gtccaaactg ggctctgaac 3300 

actacagcct gccttggagc tggcctatct ctgggggtgt tagagagttt gtggggatct 3360 

cttctgagta ccccaggact tgggaaataa tgcaagagta tccctttcag tacataggta 3420 

agcttggcta tgttcaatac ggcagagcca aaagccaagt rctaaggcag ccgaaataaa 3480 

aagct^tgaa acctaaaaaa aaaaaaaaaa aa 3512 



<210> 4 
<211> 890 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Asp Ser Asn Thr Ala Pro Leu 

1 5 
Pro Ala Pro Gin Pro Gin Ala Arg 
20 

Leu Glu Gin Glu Arg Ser Glu Arg 

35 40 
Gly Pro Gly Pro Gly Val Arg Asp 
50 55 



Gly Pro Ser Cys Pro Gin Pro Pro 

10 15 
Ser Arg Leu Asn Ala Thr Ala Ser 
25 30 
Pro Arg Ala Pro Gly Pro Gin Ala 
45 

Ala Ala Ala Pro Ala Glu Pro Gin 
60 
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485 








Val 


Val 


Leu 


Asn 


Val 


Ser 


Val 


Pro 








500 










Tyr 


Leu 


Phe 


Phe 


Arg 


Met 


Ala 


Gin 






515 










520 


Cys 


Tyr 


Leu 


Val 


Pro 


Tyr 


Leu 


Val 



23 



Arg 


Ala 


ASD 


Glv 


Thr 


Glv 


Pro 


Thr 






75 










80 






Val 


eu 


m 11 

u^u 


Arg 


Ala 


Lys 




on 










95 




Glu 


Val 


Glv 


Lvs 


His 


Tyr 


Leu 


Gin 












110 






Leu 


Asn 


Ser 


cys 


Thr 


Ala 


Val 




















uj.y 


Arg 




Glu 


Ala 


Val 


Lys 


Leu 


















Arg 


ui.y 


He 


Thr 


Ser 


Qlu 


Asn 


Glu 






XS9 










xou 


viXU 


inr 


Asp 


Leu 




Arg 


Ala 


Val 




170 










175 




Trp 
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870 








875 








880 


Phe 


Phe 


Phe 


Phe 


Pro 


Phe 


Leu 


Ser 


Ala Ala 




















885 








890 
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Pro 


He 


Leu 
575 


Val 


Ala 


Gly 


Leu 


AX a 


Leu 


Met 


Gly 


Thr 


vax 




Fne 


Ala 


Arg Trp 


Phe 


Leu 








5B0 










585 










590 






Ser 


Leu 


Asp 


Leu 


Thr 


Lys 


xie 


Met 


vax 


Thr 


Thr 


vax 


He 


Cys 


Gly Val 






595 










600 










605 








Pro 


Leu 
610 


Leu 


Phe 


Arg 


Trp 


Trp 
615 


Thr 


Lys 


Ala 


Asn 


Phe 
620 


Ser 


Val 


Met 


Gly 


Met 


Val 


Lys 


Ser 


Leu 


Thr 


Lys 


Ser 


Ser 


Met 


Val 


Lys 


X«eu 


He 


Leu 


Val 


625 










630 










635 










640 


Trp 


Leu 


Thr 


Ala 


He 


Leu 


Leu 


Phe 


Cys 


Trp 


Phe 


Tyr 


Val 


Tyr 


Arg ser 










645 










650 










655 




Glu 


Gly 


Met 


Lys 


Val 


Tyr 


Asn 


Ser 


Thr 


Leu 


Thr 


Trp 


Qln 


Qln 


Tyr Qly 








660 










665 










670 






Phe 


Leu 


Cys 
675 


Gly 


Pro 


Arg 


Ala 


Trp 
680 


Lys 


Glu 


Thr 


Asn 


Met 
685 


Ala 


Arg 


Thr 


Qln 


He 


Leu 


Cys 


Ser 


His 


Leu 


Glu 


Gly 


His 


Arg 


Val 


Thr 


Trp 


Thr Gly 




690 










695 










700 










Arg 


Phe 


Lys 


Tyr 


Val 


Arg 


Val 


Thr 


Glu 


He 


Asp 


Asn 


Ser 


Ala 


Glu 


Ser 


705 










710 










715 










720 


Ala 


He 


Asn 


Met 


Leu 
725 


Pro 


Phe 


Phe 


Leu 


Gly 
730 


Asp 


Trp 


Met 


Arg 


Cys 
735 


Leu 


Tyr 


Gly 


Glu 


Ala 
740 


Tyr 


Pro 


Ser 


Cye 


Ser 

745 


Ser 


Gly 


Asn 


Thr 


Ser 
750 


Thr 


Ala 


Glu 


Glu 


Glu 
755 


Leu 


Cys 


Arg 


Leu 


Lys 
760 


Gin 


Leu 


Ala 


Lys 


His 
765 


Pro 


Cys 


His 


lie 


Lys 
770 


Lys 


Phe 


Asp 


Arg 


Tyr 
775 


Lys 


Phe 


Glu 


He 


Thr 
7B0 


Val 


Gly 


Met 


Pro 


Phe 


Gly 


Thr 


Asn 


Gly 


Asn 


Arg 


Gly 


His- 


"Glu 


Glu 


Asp 


Asp 


He 


Thr 


Lys 


785 










790 










795 










800 


Asp 


He 


Val 


Leu 


Arg 
805 


Ala 


Ser 


Ser 


Glu 


Phe 
810 


Lys 


Asp 


Val 


Leu 


Leu 
815 


Asn 


Leu 


Arg 


Gin 


Gly 


Ser 


Leu 


He 


Glu 


Phe 


Ser 


Thr 


He 


Leu 


Glu 


Gly Arg 








820 










825 










830 






Leu 


Gly 


Ser 

B35 


Lya 


Trp 


Pro 


Val 


Phe 
840 


Glu 


Leu 


Lya 


Ala 


He 
845 


Ser 


Cys 


Leu 


Asa 


Cys 
850 


Met 


Thr 


Gin 


Leu 


Ser 
855 


Pro 


Ala 


Arg 


Arg 


His 
860 


Val 


Lys 


He 


Glu 


Gin 


Asp 


Trp 


Arg 


Ser 


Thr 


Val 


His 


Gly 


Ala 


Leu 


Lys 


Phe 


Ala 


Phe 


Asp 


865 










870 










875 










880 


Phe 


Phe 


Phe 


Phe 


Pro 


Phe 


Leu 


Ser 


Ala 


Ala 















885 890 



<210> 6 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<400> 6 

tgtaaaacga cggccagtct cgtgcagaag gccgcgct 38 



<210> 7 



wo 00/18787 



27 

<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 7 

caggaaacag ctatgaccgc ccacagccac cgcgccac 
<2io> a 

<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 8 

tgtaaaacga cggccagtct gtctccagca gacactaa 

<210> 9 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<400> 9 

caggaaacag ctatgaccca caatgctgaa ctgcagag 

<210> 10 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer** 
<400> 10 

tgtaaaacga cggccagtct gaagaccctc atgccttg 

<210> 11 
<211> 38 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<400> 11 

caggaaacag ctatgaccac acttctccgt gggctgcg 

<210> 12 
<211> 38 
<212> DNA 

<213> Artificial Sequence 



wo 00/18787 
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2S 

<220> 

<223> /note- "synthetic construct - primer" 
<400> 12 

tgtaaaacga cggccagttc ggagaatctg gaggctga 38 

<210> 13 
<211> 38 
<212> OKA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 13 

caggaaacag ctatgaccca ttacaagctg ctcaaccc 38 

<210> 14 
<211> 38 
<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 14 

tgtaaaacga cggccagtcg aaagccttcc aggcagag 38 

<210> 15 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 15 

caggaaacag ctatgaccct atgggaaggt cctggctc 38 

<210> 16 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<400> 16 

tgtaaaacga cggccagtct aggaacagtg cgccagtt 38 

<210> 17 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic . construct - primer" 



<400> 17 
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caggaaacag ctatgaccat ggagtcgcac aggaagga 3 8 

<210> 18 
<211> 38 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> /note« "synthetic construct - primer" 



<400> 18 

tgtaaaacga cggccagtgc ccatgctgtt ttctctca 3d 

<210> 19 
<211> 38 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> /note= "synthetic construct - primer" 
<400> 19 

caggaaacag ctatgacccc gaggacacat ccttatga 38 



<210> 20 

<211> 38 

<21~2> DMA 

<213> Artificial 



Sequence 



<220> 

<223> /noce» "synthetic construct - primer" 
<400> 20 

tgtaaaacga cggccagtcc tcgttcccac gtaccatc 3B 



<210> 21 

<211> 38 

<212> DNA 

<213> Artificial 



Sec[uence 



<220> 

<223> /note= "synthetic construct - primer" 
<400> 21 

caggaaacag ctatgaccgt agcagtaggt gcccttga 3 6 

<210> 22 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- '•synthetic construct - primer" 
<400> 22 

tgtaaaacga cggccagtcc tggtcgtcct caatgtca 3 8 

•* *<210> 23 
<211> 38 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 23 

caggaaacag ctatgaccca tagaaccagc agaacagc 

<210> 24 
<211> 38 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 24 

tgtaaaacga cggccagttg gttcacgtct ctggagct 

<210> 25 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 25 

caggaaacag ctatgaccga gttgtagacc ttcatgcc 

<210> 26 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<40Q> 26 

tgtaaaacga cggccagtgg gcatgaaggt ctacaact 

<210> 27 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 27 

caggaaacag ctatgaccga acttcttgat gtggcagg 

<210> 28 

<21X> 38 

<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> /note« "synthetic construct - primer" 
<400> 2B 

tgtaaaacga cggccagtct ggatgcgctg cctctacg 

<210> 29 
c211> 38 
<212> DUA 

<213> Artificial Sequence 
<220> 

c223> /note- "synthetic construct - primer" 
<400> 29 

caggaaacag ctatgacctc aggccgccga caggaatg 

<210> 30 

<211> 38 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> /note- "synthetic construct - primer" 
<400> 30 

tgtaaaacga cggccagttc gccttcgact tctttttc 

<210> 31 
<211> 38 
<212> DNA 

<2X3> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 31 

caggaaacag ctatgacccc aaacaaataa gaaatgct 

<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
-c220> 

<223> /notes "synthetic construct - primer' 

<400> 32 
gcccagctct cgcccaccag 

<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer 

<400> 33 
tcaggccgcc gacaggaatg 
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<210> 34 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 34 

cctggtcgtc ctcaatgtca 20 

<210> 35 
<2I1> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct - primer" 
<400> 35 

ggtagggcac aaggtagcag 20 

<210> 36 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer* 
<400> 36 

gggcatgaag gtctacaact cca 23 

<210> 37 
<211> 36 
<212> DKA 

<:213> Artificial Sequence 
<220> 

c223> /note- "synthetic construct - primer" 
<400> 37 

ccgtagaggc agcgcatcca gtcgccgacc tagaac 36 

<210> 3B 
<211> 20 
<212> DKA 

c213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct - primer" 
<400> 38 

gagggcatga aggtctacaa 20 

<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> /notes "synthetic construct 

<400> 39 
cccacggtaa tctcaaactt 

<210> 40 
<2I1> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> /note« "synthetic construct 

<400> 40 
tagtgtgccc ctgctgttgc 

<210> 41 
<211> 20 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> /note- "synthetic construct 

<400> 41 
cccacggtaa tctcaaactt 

<210> 42 
<211> 20 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> /notes "synthetic construct 

<400> 42 
cggtttcgga gcaacttcgc 

<210> 43 

<211> 20 

<212> DHA 

<213> Artificial Sequence 
<220> 

<223> /note= "synthetic construct 



- primer" 

20 

- primer" 

20 

- primer" 

20 

- primer" 

20 

- primer" 



<400> 43 
cacctcagcc tcgttctcag 



20 
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